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DRA SEQUENCES , RECOMBINANT DMA MOLECULES 
AND PROCESSES FOR PRODUCING SOLUBLE 

T4 PROTEINS . 



5 TECHNICAL FIELD OF INVENTION 

This invention relates to DNA sequences, 
recombinant DNA molecules and processes for producing 
soluble T4 proteins. More particularly, this invention 
relates to DNA sequences that are characterized in 

10 that they code on expression in an appropriate uni- 
cellular host for soluble forms of T4, the receptor 
on the surface of T4* lymphocytes, or derivatives 
thereof. In accordance with this invention, the DNA 
sequences, recombinant DNA molecules and processes 

15 of this invention may be employed to produce soluble 

T4 essentially free of other proteins of human origin. 
This soluble protein may then advantageously be used 
in the immuno therapeutic, prophylactic, and diag- 
nostic compositions and methods of this invention - 

2 0 The soluble T4 protein-based immunothera- 

peutic compositions and methods of this invention 
are useful in treating immunodef icient patients suf- 
fering from diseases caused by infective agents whose 
primary targets are T4* lymphocytes. According to a 
25 preferred embodiment, this invention relates to solu- 
ble T4 protein-based compositions and methods which 
are useful in preventing, treating or detecting 
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acquired immune deficiency syndrome, AIDS related 
complex and HIV infection, 

BACKGROUND ART 

The class of immune regulatory cells known 
5 as T cell lymphocytes can be divided into two broad 
functional classes, the first class comprising T 
helper or inducer cells — which mediate T cell pro- 
liferation, lymphokine release and helper cell inter- 
actions for Ig release, and the second class compris- 
10 ing T cytotoxic or suppressor cells — which parti- 
cipate in T cell-mediated killing and immune response 
suppression. In general, these two classes of 
lymphocytes are distinguished by expression of one 
of two surface glycoproteins: T4 (m.w. 55,000-62,000 
15 daltons) which is expressed on T helper or inducer 
cells, probably as a aonomeric protein, or T8 (m.w. 
32,000 daltons) which is expressed on T cytotoxic or 
suppressor cells as a dimeric protein. 

The primary structures of T4 and T8 have 
20 been deduced from their respective cDNA sequences 

[P. J. Haddon et al., ••The Isolation and Nucleotide 
Sequence Of A cDKA Encoding The T Cell Surface Protein 
T4: A New Member Of The Immunoglobulin Gene Family* 9 , 
Cell , 42, pp. 93-104 (1985); D. R. Littman et al., 
25 "The Isolation And Sequence Of The Gene Encoding T8: 
A Molecule Defining Functional classes Of T Lympho- 
cytes-, Cell , 40, pp. 237-46 (1985)]. Both predicted 
pro tein sequences define molecules with domains 
expected for surface antigens, including transmem- 
30 brane and intracytoplasmic domains at the carboxyl 

"end of the protein. In addition, both proteins con- 
tain an amino terminal region which shows striking 
homology to immunoglobulin and T cell receptor 
variable regions and which might function during 
35 target cell recognition f Maddon et al; , supra ] . 



3652 



89085513 



WO 89/01940 



PCT/US88/02940 



-3- 

In immunocompetent individuals, T4 lympho- 
cytes interact with other specialized cell types of 
the immune system to confer immunity to or defense 
against infection [E. L. Reinherz and S. F. 
5 Schlossmann "The Differentiation Function Of Human 
T-Cells", Cell , 19, pp. 821-27 (1980)]. More speci- 
fically, T4 lymphocytes stimulate production of growth 
factors which are critical to a functional immune 
system. For example, they act to stimulate B cells, 

10 the descendants of hemopoietic stem cells, which 
promote the production of defensive antibodies. 
They also activate macrophages ("killer cells") to 
attack infected or otherwise abnormal host cells and 
they induce monocytes ("scavenger cells") to encompass 

15 and destroy invading microbes. 

It has been found that the primary target 
of or receptor for certain infective agents is the 
T4 surface protein. These agents include , for 
example, viruses and retroviruses. When T4 lympho- 

20 cytes are exposed to such agents, they are rendered 

nonfunctional. As a result, the host 1 s complex immune 
defense system is destroyed and the host becomes 
susceptible to a wide range of opportunistic infec- 
tions. 

25 Such immunosuppression is seen in patients 

suffering from acquired immune deficiency syndrome 
("AIDS"). AIDS is a disease characterized by severe 
or, typically, complete immunosuppression and 
attendant host susceptibility to a wide range of 

30 opportunistic infections and malignancies. In some 
cases, AIDS infection is accompanied by central 
nervous system disorders. Complete clinical mani- 
festation of AIDS is usually preceded by AIDS 
related complex ( "ARC" ) , a syndrome accompanied by 

35 symptoms such as persistent generalized lymphadeno- 
pathy, fever and weight loss. The human immunode- 
ficiency virus ("HIV") retrovirus is thought to be 
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the etiological agent responsible for AIDS infection 
and its precursor, ARC [M. G. S .rngadharan et al., 
"Detection, Isolation And Continuous Production Of 
Cytopathic Retroviruses (HTLV-III) From Patients 
5 With AIDS And Pre— AIDS M , Science , 224, pp. 497-508 
(1984)].* 

Between 85 and 100% of the AIDS/ARCS popu- 
lation test seropositive for HIV [G. N. Shaw et al., 
"Molecular Characterization Of Human T-Cell Leukemia 

10 (Lymphotropic) Virus Type III In The Acquired Immune 
Deficiency Syndrome" . Science , 226 , pp. 1165-70 
(1984)]. The number of adults in the United States 
infected with HIV has been estimated to be between 1 
and 2.5 million [D. Barnes, "Strategies For An AIDS 

15 Vaccine", Science . 233 . pp. 1149*53 (1986); M. Rees, 
•The Sombre View Of AIDS", Nature , 326, pp. 343-45 
(1987)]. These estimates include 64,900 individuals 
who do not belong to an identified group at risk for 
AIDS IS. L. SivaJc and G. P. Wormser, "How Common Is 

20 HTLV-III Infection In The United States?", New Eng. 
J . Med . , 313. p. 1352 (1985)]. The apparent annual 
rate of diagnosis for those infected with HIV virus 
is between 1 and 2% — a rate which may increase 
significantly in future years. 

25 The genome of retroviruses, such as HIV, 

con tains three regions encoding structural proteins. 
The gag region encodes the core proteins of the 
virion. The pol region encodes the virion RNA-depen- 
dent DNA polymerase (reverse transcriptase). The 



* in this application, human immunodeficiency 
virus ( "HIV" ) , the generic term adopted by the human 
retrovirus subcommittee of the International Committee 
On Taxonomy Of Viruses to refer to independent iso- 
35 lates from AIDS patients, including human T cell 
lymphotropic virus type III ( "HTLV— I I I" ) , lympha- 
denopathy-associated virus ("LAV" ), human immuno- 
deficiency virus type 1 ("HIV-1") and AIDS-associated 
retrovirus ("ARV") will be used. 
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env region encodes the major glycoprotein found in 
the membrane envelope of the virus and in the cyto- 
plasmic membrane of infected cells. The capacity of 
the virus to attach to target cell receptors and to 
5 cause fusion of cell membranes are two HIV proper- 
ties controlled by the env gene. These properties 
are believed to play a fundamental role in the patho- 
genesis of the virus. 

HIV env proteins arise from a precursor 

10 polypeptide that, in mature form, is cleaved into a 
large heavily glycosylated exterior membrane protein 
of about 481 amino acids — gp!20 — and a smaller 
transmembrane protein of about 345 amino acids which 
may be glycosylated — gp41 [L. Ratner et al. , 

15 "Complete Nucleotide Sequence Of The AIDS Virus, 
HTLV-III", Nature , 313, pp. 277-84 (1985)]. 

The host range of the HIV virus is asso- 
ciated with cells which bear the surface glycopro- 
tein T4 . Such cells include T4 lymphocytes and 

20 brain cells [P. J. Maddon et al., "The T4 Gene Encodes 
The AIDS Virus Receptor And Is Expressed In The 
Immune System And The Brain* 1 , Cell , 47, pp. 333-48 
(1986)]. Upon infection of a host by HIV virus, the 
T4 lymphocytes are rendered non- functional. The 

25 progression of AIDS/ARCS syndromes can be correlated 
with the depletion of T4 lymphocytes, which display 
the T4 surface glycoprotein. This T cell depletion, 
with ensuing immunological compromise, may be attri- 
butable to both recurrent cycles of infection and 

30 lytic growth from cell-mediated spread of the virus. 
In addition, clinical observations suggest that the 
HIV virus is directly responsible for the central 
nervous system disorders seen in many AIDS patients. 

The tropism of the HIV virus for T4* cells 

35 is believed to be attributed to the role of the T4 
cell surface glycoprotein as the membrane- anchored 
virus receptor. Because T4 behaves as the HIV virus 
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receptor, its extracellular sequence probably plays 
a direct role in binding HIV. .-lore specifically, it 
is believed that HIV envelope selectively binds to 
the T4 epitope(s), using this interaction to initiate 
5 entry into the host cell [A. G. Dalgelish et al . , 
••The CD4 (T4) Antigen Is An Essential Component Of 
The Receptor For The AIDS Retrovirus 1 *, Nature , 312/ 
pp. 763-67 (1984); D. Klatzmann et al., M T- Lymphocyte 
T4 Molecule Behaves As The Receptor For Hunan Retro- 
10 virus LAV", Nature, 312, pp. 767-68 (1984)]. Accord- 
ingly, cellular expression of T4 is believed to be 
sufficient for HIV binding, with the T4 protein 
serving as a receptor for the HIV virus. 

The T4 tropism of the HIV virus has been 
15 demonstrated in vitro . When HIV virus isolated from 
AIDS patients is cultured together with T helper 
lymphocytes preselected for surface T4 , the lympho- 
cytes are efficiently infected, display cytopathic 
effects, including multinuclear syncytia formation 
20 and are killed by lytic growth [D. Rlatzmann et al., 
" Selective Tropism Of Lymphadenopathy Associated 
Virus (LAV) For Helper- Inducer T Lymphocytes* , 
Science , 225. pp, 59-63 (1984); F. Wong-Staal and 
r. c. Gallo, "Human T-Lymphotropic Retroviruses". 
25 Nature , 317, pp. 395-403 (1985)]. It has been demon- 
strated that a cloned cDNA version of human T4, when 
expressed, on the surface of transfected cells from 
non-T cell lineages, including murine and fibroblast 
toid cells, endows those cells with the ability td 
30 bind HIV IP. J. Maddon et al., "The T4 Gene Encodes 

The AIDS Virus Receptor And Is Expressed In The Immune 
System And The Brain", Cell , 47, pp. 333-48 (1986)]. 

During the course of HIV infection, the 
host mounts both a humoral and a cellular immune 
35 response to the virus. These responses include the 
appearance of antibodies which bind to a number of 
viral products and which exhibit neutralizing effect 
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or antibody dependent cellular cytotoxic functions 
[M. Guroff-Robert et al . , "HTL\ -II I -Neutralizing 
Antibodies In Patients With AIDS And AIDS-Related 
Complex", Nature , 316, pp. 72-74 (1985); D. D. F. 
Barin et al., "Virus Envelope Protein Of HTLV-III 
Represents Major Target Antigen For Antibodies In 
AIDS Patients M , Science , 228, pp. 1094-96 (1985); 
A. H. Rook et al., "Sera From HTLV-III/LAV Antibody 
Positive Individuals Mediate Antibody Dependent 
Cellular Cytotoxicity Against ETLV-III/LAV Infected 
T Cells" , J. Immunol. , 138, pp. 1064-68 (1987)]. 
Epitopes of the HIV envelope have been identified as 
important determinants in eliciting a neutralizing 
antibody response. And, determinants in antibody 
dependent cellular cytotoxicity ("ADCC") activity 
include HIV env and, possibly, gag epitopes. 

In the absence to date of effective treat- 
ments for AIDS, many efforts have centered on preven- 
tion of the disease. Such preventative measures 
include HIV antibody screening for all blood, organ 
and semen donors and education of AIDS high-risk 
groups regarding transmission of the disease. 

Experimental or early-stage clinical treat- 
ment of AIDS and ARCS conditions have included the 
administration of antiviral drugs, such as HPA-23, 
phosphono formate, suramin, ribavirin, azido thymidine 
( M AZT W ) and dideoxycytidine, which apparently inter- 
fere with replication of the virus through reverse 
transcriptase inhibition. Although each of these 
drugs exhibits activity against HIV in vitro , only 
AZT has demonstrated potential benefits in clinical 
trials. AZT administration in effective amounts, 
however, has been accompanied by undesirable and 
debilitating side effects, such as bone marrow 
depression. It is likely, therefore, that hemato- 
logic toxicity will be a major rate limiting factor 
in the long term use of AZT. 
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Other proposed methods for treating AIDS 
have focused on the development of agents having 
activity against steps in the viral replicative cycle 
other than reverse transcription. Such methods 
5 include the administration of interferons or the 

application of hybridoma technology. Most of these 
treatment strategies are expected to require the 
co-administration of immunomodulators, such as inter- 
leuJcin-2 . 

10 To date, the need exists for the develop- 

ment of effective immuno therapeutic agents and methods 
for the treatment of AIDS , ARCS , HIV infection and 
other immunodeficiencies caused by T lymphocyte 
depletion or abnormalities. 

IS DISCLOSURE OF THE INVENTION 

The present invention solves the problems 
referred to above by providing, in large amounts, 
soluble T4 and soluble derivatives thereof that act 
as receptors for infective agents whose primary target 

20 is the T4 surface protein of T4* lymphocytes . Advan- 
tageously, this invention also provides soluble T4 
essentially free of other proteins of human origin 
and in a form that is hot contaminated by viruses, 
such as HIV or hepatitis B virus. 

25 As will be appreciated from the disclosure 

to follow, the DNA sequences and recombinant DMA 
molecules of this invention are capable of directing, 
in an appropriate host, the production of soluble T4 
or derivatives thereof. The polypeptides of this 

30 invention are useful, either as produced in the host 
or after further derivatization or modification, in 
a variety of immunotherapeutic compositions and 
methods for treating immunodef icient patients 
suffering from diseases caused by infective agents 

35 whose primary targets are T4 lymphocytes. According 
to various embodiments of this invention, such compo- 
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sitions and methods relate to a soluble receptor for 
HIV, soluble T4 proteins and pc ..ypeptides and anti- 
bodies thereto. The soluble T4 proteins and polypep- 
tides of this invention include monovalent, as well 
as polyvalent forms . 

The compositions and methods of this inven- 
tion, which are based upon soluble T4 proteins, poly- 
peptides or peptides and antibodies thereto, are 
particularly useful for the prevention, treatment or 
detection of the HIV-related infections AIDS and 
ARC- More specifically, the soluble T4-based com- 
positions and methods of this invention employ 
soluble T4-like polypeptides — polypeptides which 
advantageously interfere with the T4/HIV interaction 
15 by blocking or competitive binding mechanisms which 
inhibit HIV infection of cells expressing the T4 
surface protein- These soluble T4-like polypeptides 
inhibit adhesion between T4* lymphocytes and infec- 
tive agents which target T4* lymphocytes and inhibit 
interaction between T4* lymphocytes and antigen pre- 
senting cells and targets of T4* lymphocytes mediated 
Killing. By acting as soluble virus receptors, the 
compositions of this invention may be used as anti- 
viral therapeutics to inhibit HIV binding to T4* 
25 cells and virally induced syncytium formation at the 
level of receptor binding. 

This invention accomplishes these goals by 
providing DNA sequences coding on expression in an 
appropriate unicellular host for soluble T4 proteins* 
30 and soluble derivatives thereof. 



* As used in this application, "soluble T4 pro- 
tein", "soluble T4" and "soluble T4-like polypeptides' 
include all proteins, polypeptides and peptides which 
35 are natural or recombinant soluble T4 proteins, or 
soluble derivatives thereof, and which are charac- 
terized by the immunotherapeutic ( anti-retroviral ) 

( footnote continued on following page) 
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This invention also provides recombinant 
DNA molecules containing those -)NA sequences and 
unicellular hosts transformed with them. Those hosts 
permit the production of large quantities of the 
5 novel soluble T4 proteins, polypeptides, peptides 
and derivatives of this invention for use in a wide 
variety of therapeutic, prophylactic and diagnostic 
compositions and methods 

The DNA sequences of this invention are 
10 selected from the group consisting cf : 

(a) the DNA inserts of pl99-7, pBG377, 
P&G380, pBG381, p203-5, pBG391, pBG392, pBC393, 
PBC394, pBG395, pBG396, pBG397, p211-ll. p214-10 
and p215-7; 

15 <*>> DNA sequences which hybridize to one 

or more of the foregoing DNA inserts and which code 
on expression for a soluble T4-like polypeptide; and 

(c) DNA sequences which code on expression 
for a soluble T4-liJte polypeptide coded for on 
expression by any of the foregoing DNA inserts and 
sequences. 

According to an alternate embodiment, this 
invention also relates to a DNA sequence comprising 
the DNA insert of pl70-2, said sequence coding on 
25 expression for a T4-like polypeptide. And. this 

invention also relates to recombinant DNA molecules 
and processes for producing T4 protein using that 
DNA sequence. 



20 



30 (footnote continued from preceding page) 

or immunogenic activity of soluble T4 protein. They 
include soluble T4-like compounds from a variety of 
sources, such as soluble T4 protein derived from 
natural sources, recombinant soluble T4 protein and 
35 synthetic or semi-synthetic soluble T4 protein. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is an autoradiograph depicting 
the purification of T4 protein from U937 cells by 
inununoaffinity chromatography, 
5 Figure 2 depicts autoradiograph and Western 

blot data demonstrating that immunoaf f inity-purif ied, 
solubilized native T4 protein binds to HIV envelope 
protein. 

Figure 3 depicts the nucleotide sequence 
10 and the derived amino acid sequence of T4 cDNA 

obtained from PBL clone A203-4. In this figure, the 
amino acids are represented by single letter codes 
as follows : 

Phe: F Leu: L He: I Met: M 

15 Val: V Ser: S Pro: P Thr: T 

Ala: A Tyr: Y His: H Gin: Q 

Asn: N Lys: K Asp: D Glu: E 

Cys: C Trp: W Arg: R Gly: G 

* = position at which a stop codon is 

20 present. 

In Figure 3 , the T4 protein translation 
start ( AA_ 23 ) is located at the methionine at nucleos- 
ides 201-203 and the mature N- terminus is located at 
the lysine (AA 3 ) at nucleotides 276*278. 
25 Figure 4 is a schematic outline of the 

construction of cDNA clones pBG312.T4 (also called 
p!71-l) and p!70-2. 

Figure 5 is a schematic outline of the 
construction of plasmid pEClOO. 
30 . Figure 6 depicts amino acid comparisons at 

a positions 3, 64 and 231 of various T4 cDNA clones. 

Figures 7A and 7B depict the protein domain 
structure of purified* solubilized T4 protein and 
recombinant soluble T4 mutants. 
35 Figures 8A-8D are schematic outlines of 

constructions of various intermediate plasmids and 
other plasmids used to express recombinant soluble 
T4 ( M rsT4 M ) of this invention. 
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Figure 9A i-3 a schematic outline of the 
construction of plasmid pl99-7. 

Figures 9B and 9C are schematic outlines 
of the construction of plasmid p203-5. 
5 Figure 10 depicts the synthetic oligo- 

nucleotide linkers employed in various constructions 
according to this invention. 

Figure 11 depicts the nucleotide sequence 
of the entire plasmid defined by p!99-7 (F L mutet.rsT4) 
L0 and its rsT4-2 insert and the amino acid sequence 
deduced from the rsT4 sequence. This includes the 
Clal-Clal cassette which defines the Met perfect 
rsT4.2 coding sequence. 

Figure 12 depicts a protein blot analysis 
L5 of an induction of rsT4.2 expression from 
SG936/pl99-7. 

Figure 13 is a schematic outline of the 
construction of plasmid pBG368 . 

Figures 14A-14C are schematic outlines of 
20 constructions of various plasmids of this invention - 
Figure 15 depicts the nucleotide sequence 
of plasmid pBG391. 

Figure 16 depicts the nucleotide sequence 
of plasmid pBG392. In this figure, the T4 protein 
25 translation start <AA_ 23 ) is located at the methio- 
nine at nucleotides 1207-1209 and the mature 
N- terminus is located at the lysine (AA 3 ) at nucleo- 
tide 1281-84. 

Figure 17 is a schematic outline of con- 
30 structions of various plasmids of this invention. 

Figure 18 depicts the synthetic oligonucleo- 
tide linkers employed in various constructions 
according to this invention. 

Figure 19 depicts the nucleotide sequence 

35 of plasmid pBG394. 

Figure 20 depicts the nucleotide sequence 

of plasmid pBG396. 
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Figure 21 depicts the nucleotide sequence 
of plasmid pBG393. 

Figure 22 depicts the nucleotide sequence 
of plasmid pBG395. 
5 Figure 23 is a Coomassie stained gel of 

rsT4.2 purified from the conditioned medium of the 
pBG380 transfected CEO cell line BG380C of plasmid 
p!96-10. 

Figure 24 is a schematic outline of the 
10 construction of plasmid pl96-10. 

Figure 25 is a schematic outline of the 
construction of plasmid pBG394. 

Figure 26 is a schematic outline of the 
construction of plasmid p211-ll. 
15 Figure 27 is a schematic outline of the 

construction of plasmid p215-7. 

Figure 28 is a schematic outline of the 
construction of plasmid p218-8. 

Figure 29A is a Coomassie stained gel of 
20 rsT4. 113.1 purified from the conditioned medium of 
PBG211-11 transfected E.coli. 

Figure 29B is an autoradiograph depicting 
a Western blot analysis of rsT4. 113.1 expressed in 
E.coli . 

25 Figure 30, panels (a)-(c) depict the puri- 

fication of rsT4. 113.1 from E.coli trans formants . 

Figure 31. panels (a)-(c) depict the 
refolding of purified rsT4. 113.1. 

Figure 32 is an autoradiograph depicting 
30 the immunoprecipitation of 35 S-metabolically labelled 
CHO cell lines producing recombinant soluble T4. 

Figure 33 depicts an immunoblot analysis 
"of COS 7 cell lines producing recombinant soluble T4. 
Figure 34 depicts in graphic form the 
35 results of a competition assay between rsT4. 113.1 
and rsT4.3 for binding to OKT4A or OKT4. 
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Figures 35-37 depict in graphic form the 
results of competition assays ..etween rsT4.111 and 
rsT4.3 for binding to, respectively, OKT4A, Leu-3A 
and OKT4. 

5 Figure 38 depicts in graphic form an ELISA 

assay for rsT4. 113.1 from E.coli trans formants . 

Figure 39 depicts in graphic form the 
results of a p24 radioimmunoassay using recombinant 
soluble T4 according to this invention. 
10 Figures 40 and 41 depict the results of 

syncytia inhibition assays using recombinant soluble 
T4 proteins according to this invention. 

Figure 42 is a schematic outline of the 
construction of pi as mid pBiv.l. 
15 Figure 43 depicts the bivalent recombinant 

soluble T4 protein produced by pBiv.l. 

DETAILED DESCRIPTION OF THE INVENTION 

We isolated the DNA sequences of this 
invention from two libraries: a Agt cDNA library 

20 derived the T cell tumor line REX and a XgtlO cDNA 
library derived from peripheral blood lymphocytes. 
However, we could also have employed libraries pre* 
pared from other cells that express T4. These 
include, for example, H9 and U937. We also used a 

25 human genomic bank to isolate various fragments of 
the T4 gene. 

For screening these libraries , we used a 
series of chemically synthesized anti-sense oligo- 
nucleotide DNA probes based upon the T4 protein 

30 sequence set forth in Maddon et al. (1985), supra. 

For screening, we hybridized our oligo- 
nucleotide probes to our cDNA libraries utilizing a 
plaque hybridization screening assay. We selected 
clones hybridizing to several of our probes. And, 

35 after isolating and subcloning the cDNA inserts of 
the selected clones into plasmids, we determined 
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their nucleotide sequences and compared the amino 
acid sequences dedu :ed from those nucleotide sequences 
to the amino acid sequences referred to in Maddon 
et al. (1985), supra. As a result of these compari- 
5 sons, we determined that all of our selected clones 
were characterized by cDNA inserts coding for amino 
acid sequences of human T4. 

We have depicted in Figure 3 the nucleo- 
tide sequence of full-length T4 cDNA obtained from 

10 deposited clone p!70-2 and the amino acid sequence 
deduced therefrom. That cDNA sequence was subse- 
quently subjected to in vitro site-directed mutagen- 
esis and restriction fragment substitution so that 
its cDNA sequence was identical to that of Maddon 

15 et al. 

After modifying our T4 cDNA sequence to be 
identical to that of Maddon et al , . we truncated 
samples of it in various positions to remove the 
coding regions for the transmembrane and intracyto- 

20 plasmic domains. The remaining cOKA sequences encoded 
a soluble T4 which retained the extracellular region 
believed to be responsible for HIV binding. 

We then constructed various clones charac- 
terized by such cDNA inserts coding for human soluble 

25 T4. Those cDNA sequences may be used in a variety 
of ways in accordance with this invention. More 
particularly/ those sequences or portions of them, 
or synthetic or semi-synthetic copies of them, may 
be used as DNA probes to screen other human or animal 

30 cDNA or genomic libraries to select by hybridization 
other DNA sequences that are related to soluble T4. 
Typically, conventional hybridization conditions, 
"e.g., about 20° to 27°C below Tm, are employed in 
such selections. However, less stringent conditions 

35 may be necessary when the library is being screened 
with a probe from a different species than that from 
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which the library is derived, e.g.* the screening of 
a mouse library with a human pr. oe . 

Such cDNA inserts, portions of them, or 
synthetic or semi-synthetic copies of them, may also 
5 be used as starting materials to prepare various 

mutations. Such mutations may be either degenerate, 
i.e., the mutation does not change the amino acid 
sequence encoded by the mutated codon, or non- 
degenerate, i.e., the mutation changes the amine 

10 acid sequence encoded by the mutated codon. Both 

types of mutations may be advantageous in producing 
or using soluble T4's according to this invention. 
For example, these mutations may permit higher levels 
of production or easier purification of soluble T4 

15 or higher T4 activity. 

For all of these reasons, the DMA sequences 
of this invention are selected from the group con- 
sisting of: 

(a) the DNA inserts of pl99-7, pBC377, 

20 pBG380, pBG381, p203-5, pBG391, pBC392, pBG393, pBG394, 
PBG395, pBC396, pBG397, p211-ll, p214-10 and p215-7; 

(b) DMA sequences which hybridize to one 
or more of the foregoing DMA inserts and which code 
on expression for a soluble T4-like polypeptide; and 

25 (c) DMA sequences which code on expres- 

sion for a soluble T4-like polypeptide coded for on 
expression by any of the foregoing DMA inserts and 
sequences. 

Preferably, the DMA sequences of this 
30 invention code for a polypeptide selected from the 
group consisting of a polypeptide of the formula 
AA_ 23 -AA 362 of Figure 3, a polypeptide of the formula 
^1-362 of Fi 9 ure 3 » a polypeptide of the formula 
Met-AA 1 _ 362 °^ Figure 3, a polypeptide of the formula 
35 ^1-374 o£ Figure 3, a polypeptide of the formula 

Met-AA 1-374 of Figure 3, a polypeptide of the formula 
AA. of Figure 3, a polypeptide of the formula 
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Met-AA 1 _ 377 of Figure 3, a polypeptide of the formula 
AA_ 23 -AA 3?4 of Figure 3, a polyp ptide of the formula 
AA -23 -AA 3?7 of Figure 3, or portions thereof. 

DNA sequences according to this invention 
5 also preferably code for a polypeptide selected from 
the group consisting of a polypeptide of the formula 
AA_ 23 -AA 1Q2 of Figure 16, a polypeptide of the 
formula AA 1 -AA 182 of Figure 16, a polypeptide of 
the formula Met - AA i-ie2 of Fi 9ure 16, a polypeptide 

10 of the formula AA _ 2 3~ AA 182 of Fi ^ ure 16 • followed by 

the amino acids asparagine-leucine-glutamine-histidine- 
serine-leucine, a polypeptide of the formula 
AA^-AA 182 of Figure 16, followed by the amino acids 
asparagine-leucine-glut ami ne-histidinc-serine-leucine , 

15 a polypeptide of the formula Met-AA 1-182 of Figure 16, 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 
the formula AA_ 23 -AA 113 of Figure 16, a polypeptide 
of the formula AA 1 tAA 113 of Figure 16 , a polypeptide 

20 of the formula Met-AA 1-113 of Figure 16, a polypeptide 
of the formula AA_ 23 -AA 111 of Figure 16, a polypeptide 
of the formula AA 1 -AA 111 of Figure 16, a polypeptide 
of the formula Met-AA 1 _ 111 of Figure 16, a polypep- 
tide of the formula AA ^23" AA 131 of Fi 9ure 16, a poly- 

25 peptide of the formula AA^-AA^^ of Figure '16, a 

polypeptide of the formula Met-AA^_^ 31 of Figure 16, 
a polypeptide of the formula AA «23" AA 145 °^ F ^9 ure 
a polypeptide of the formula AA 1 -AA 145 of Figure 16, 
a polypeptide of the formula Met-AA^^^ of Figure 16, 

30 a polypeptide of the formula AA_ 23 ~AA 166 of Figure 16, 
a polypeptide of the formula AA 1 -AA 1&6 of Figure 16, 
* polypeptide of the formula Met-AA 1-166 of Figure 16, 
or portions thereof . 

Additionally, DNA sequences of this inven- 

35 tion code for a polypeptide selected from the group 

consisting of a polypeptide of the formula AA_ 23 «AA 362 

of mature T4 protein, a polypeptide of the formula 
3 6 6 7 



89085519 



WO 89/01940 



PCT/US88/02940 



-18- 

aa of mature T4 protein, a polypeptide of the 

formula Met-AA 1 _ 362 of mature protein, a polypep- 
tide of the formula AA 1-374 of mature T4 protein, a 
polypeptide of the formula Met-AA 1 _ 374 of mature T4 
5 protein, a polypeptide of the formula AA 1 _ 37? of 
mature T4 protein, a polypeptide of the formula 
Met-AA 1 _ 377 of mature T4 protein, a polypeptide of 
the formula AA^-AAg.^ of mature T4 protein, a poly- 
peptide of the formula AA^-AA^ of mature T4 pro- 
10 tein, or portions thereof. 

DMA sequences according to this invention 
also code for a polypeptide selected from the group 
consisting of a polypeptide of the formula AA _23~ AA 182 
of mature T4 protein, a polypeptide of the formula 
IS AA i- AA i82 of -ature T4 P roteia » a polypeptide of the 
formula Met-AA 1-182 of mature T4 protein, a polypep- 
tide of the formula AA _ 2 3" AA 182 of Mture T4 P* otein ' 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine. a polypeptide of 
20 the formula AA 1 -AA 182 of mature T4 protein, followed 
by the amin o acids asparagine-leucine-glutamine- 
histidine-serine-leucine, a polypeptide of the 
formula Met- AA i-i82 of mature T4 Protein, followed 
by the amino acids asparagine-leucine-glutamine- 
25 histidine-serine-leucine. a polypeptide of the 

formula AA _ 2 3~ AA 113 of mature T4 Protein, a polypep- 
tide of the formula AA 1 ~AA 113 of mature T4 protein, 
a polypeptide of the formula Met-AA 1-113 of mature 
T4 protein, a polypeptide of the formula AA_ 23 -AA 111 
30 of mature T4 protein, a polypeptide of the formula 
- AA -AAm °* »eture T4 protein, a polypeptide of the 
' formula Met-AA 1 _ 111 of mature T4 protein, a polypep- 
tide of the formula AA_ 2 3~ AA l3l of mature T4 Protein, 
a polypeptide of the formula AA 1 -AA 131 of mature T4 
35 protein, a polypeptide of the formula Met-AA 1 _ 131 of 
mature T4 protein, a polypeptide of the formula 

AA -AA „ of mature T4 protein, a polypeptide of 
^^—23 145 
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the formula AA X -AA 145 of mature ^4 protein, a polypep- 
tide of the formula Met-AA 1-145 of nature T4 protein, 
a polypeptide of the formula ^-23*^166 of mature 
T4 protein, a polypeptide of the formula AA 1 ~AA^ 66 
5 of mature T4 protein, a polypeptide of the formula 

Met-AA 1-166 of mature T4 protein, or portions thereof. 

The amino terminal amino acid of mature 
T4 protein isolated from T cells begins at lysine, 
the third amino acid of the sequence depicted in 
10 Figure 16. Accordingly, soluble T4 proteins also 
include polypeptides of the formula AA^-AA^^ of 
Figure 16, or portions thereof. Such polypeptides 
include polypeptides selected from the group consist- 
ing of a polypeptide of the formula AA^ to AA 362 of 
15 Figure 16, a polypeptide of the formula AA^ to AA 374 
of Figure 16, a polypeptide of the formula AA 3 -AA 182 

of Figure 16, a polypeptide of the formula AA^-AA 145 

of Figure 16, and a polypeptide of the formula 
AA 3 -AA 111 of Figure 16. Soluble T4 proteins also 
include the above- recited polypeptides preceded by 
an N- terminal methionine group. 

25 Soluble T4 protein constructs according to 

this invention may also be produced by truncating 
the full length T4 protein sequence at various posi- 
tions to remove the coding regions for the transmem- 
brane and intracytoplasmic domains, while retaining 

30 the extracellular region believed to be responsible 
for HIV binding. More particularly, soluble T4 
polypeptides may be produced by conventional tech- 
niques of oligonucleotide directed mutagenesis; 
restriction digestion, followed by insertion of 

35 linkers; or chewing back full length T4 protein with 
enzymes . 



of Figure 16, a polypeptide of the formula A^-iw*^^ 

of Figure 16, a polypeptide of the formula AA^-AA^^ 

of Figure 16, a polypeptide of the formula AA^- 

20 of Figure 16, a polypeptide of the formula AA 3 -n« 166 
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Alternatively, soluble T4 polypeptides may 
be chemically synthesized by conventional peptide 
synthesis techniques, such as solid phase synthesis 
[ R. B. Merrifield, "Solid Phase Peptide Synthesis - 
5 I. The Synthesis Of A Tetrapeptide" , J. Am, Chem . 
Soc. , 83, pp. 2149-54 (1963)]. 

The DNA sequences of this invention code 
for soluble proteins and derivatives that are believed 
to bind to Major Histocompatibility Complex antigens 

10 and envelope glycoprotein of certain retroviruses, 

such as HIV. Preferably, they also inhibit syncytium 
formation, believed to be the mode of intracellular 
HIV virus spread. And, they may inhibit interaction 
between T4* lymphocytes and antigen-presenting cells 

15 and targets of T4* cell mediated killing. Most 

preferably, they also inhibit adhesion between T4* 
lymphocytes and infective agents, such as the HIV 
virus, whose primary targets are T4* lymphocytes . 

The DNA sequences of this invention are 

20 also useful for producing soluble T4 or its deriva- 
tives coded for on expression by them in unicellular 
hosts transformed with those DNA sequences. As well 
known in the. art, for expression of the DNA sequences 
of this invention, the DNA sequence should be opera- 

25 tively linked to an expression control sequence in 
an appropriate expression vector and employed in 
that expression vector to transform an appropriate 
unicellular host. 

Such operative linking of a DNA sequence 

30 of this invention to an expression control sequence, 
of course, includes the provision of a translation 
start signal in the correct reading frame upstream 
of the DNA sequence. If the particular DNA sequence 
of this invention being expressed does not begin 

35 with a methionine, the start signal will result in 
an additional amino acid — methionine — being 
located at the N- terminus of the product. While 
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such methionyl-contai^ing product- may be employed 
directly in 'the compositions and methods of this 
invention, it is usually more desirable to remove 
the methionine before use. Methods are available in 
5 the art to remove such N-tenninal methionines from 
polypeptides expressed with them. For example, 
certain hosts and fermentation conditions permit 
removal of substantially all of the N- terminal 
methionine in vivo . Other hosts require in vitro 
10 removal of the N- terminal methionine. However, such 
in vivo and in vitro methods are well known in the 
art. 

A wide variety of host/expression vector 
combinations may be employed in expressing the DNA 

15 sequences of this invention. Useful expression 
vectors, for example, may consist of segments of 
chromosomal, non-chromosomal and synthetic DNA 
sequences, such as various known derivatives of SV40 
and known bacterial plasmids, e.g. , plasmids from 

20 E.coli including col El, pCRl, pBR322, pMB9 and their 
derivatives, wider host range plasmids, e.g., RP4, 
phage DNAs, e.g., the numerous derivatives of phage X, 
e.g., NH989, and other DNA phages, e.g., M13 and 
filamenteous single stranded DNA phages, yeast plas- 

25 mids, such as the 2m plasmid or derivatives thereof, 
and vectors derived from combinations of plasmids 
and phage DNAs, such as plasmids which have been 
modified to employ phage DNA or other expression 
control sequences. For animal cell expression, we 

30 prefer to use plasmid pBG368, a derivative of pBG312 
(R. Cate et al., "Isolation Of The Bovine And Human 
Cenes For Mullerian Inhibiting Substance And 
Expression Of The Human Gene In Animal Cells", Cell , 
45, pp. 685-98 (1986)] which contains the major late 

35 promoter of adenovirus 2. 

In addition, any of a wide variety of 
expression control sequences — sequences that con- 
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trol the expression of a DNA sequence when opera- 
tic vely linked to it — may be us in these vectors 
to express the DNA sequence of this invention. Such 
useful expression control sequences, include, for 
5 example, the early and late promoters of SV40 or the 
adenovirus, the lac system, the trp system, the TAC 
or TRC system, the major operator and promoter regions 
of phage A, the control regions of fd coat protein, - 
the promoter for 3-phosphoglycerate kinase or other 

10 glycolytic enzymes, the promoters of acid phosphatase, 
e.g., PhoS, the promoters of the yeast a -mating 
factors, the polyhedron promoter of the baculo virus 
system and other sequences known to control the 
expression of genes of prokaryotic or eukaryotic 

15 cells or their viruses, and various combinations 

thereof. For animal cell expression, we prefer to 
use an expression control sequence derived from the 
major late promoter of adenovirus 2. 

A wide variety of unicellular host cells 

20 are also useful in expressing the DNA sequences of 
this invention. These hosts may include well known 
eukaryotic and prokaryotic hosts, such as strains of 
E . coli , Pseudomonas , Bacillus , Streptonyces , fungi, 
such as yeasts, and animal cells, such as CEO and 

25 mouse cells, African green monkey cells, such as 

COS 1, COS 7, BSC 1, BSC 40, and BMT 10, insect cells, 
and human cells and plant cells in tissue culture. 
For 'animal cell expression, we prefer CHO cells and 
COS 7 cells. 

30 It should of course be understood that not 

. all vectors and expression control sequences will 
function equally well to express the DNA sequences 
of this invention. Neither will all hosts function 
equally well with the same expression system. How- 

35 ever, one of skill in the art may make a selection 
among these vectors, expression control sequences, 
and hosts without undue experimentation and without 
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departing from the scope of this invention. For 
example, in selecting a vector, the host must be 
considered because the vector must replicate in it. 
The vector's copy number, the ability to control 
5 that copy number, and the expression of any other 
proteins encoded by the vector, such as antibiotic 
markers, should also be considered. 

In selecting an expression control sequence, 
a variety of factors should also be considered. 

10 These include, for example, the relative strength of 

the system, its controllability, and its compatibility 
with the particular DNA sequence of this invention, 
particularly as regards potential secondary struc- 
tures. Unicellular hosts should be selected by 

15 consideration of their compatibility with the chosen 
vector, the toxicity of the product coded for on 
expression by the DNA sequences of this invention to 
them, their secretion characteristics , their ability 
to fold proteins correctly, their fermentation re- 

20 quirements, and the ease of purification of the 

products coded on expression by the DNA sequences of 
this invention* 

Within these parameters, one of skill in 
the art may select various vector/expression control 

25 system/host combinations that will express the DNA 

sequences of this invention on fermentation or in large 
scale animal culture, e.g., CHO cells or COS 7 cells. 

The polypeptides produced on expression of 
the DNA sequences of this invention may be isolated 

30 from the fermentation or animal cell cultures and 
purified using any of a variety of conventional 
'methods. One of skill in the art may select the 
most appropriate isolation and purification tech- 
niques without departing from the scope of this 

35 invention. 

The polypeptides produced on expression of 
the DNA sequences of this invention are essentially 
3673 
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free of other proteins of human origin. Thus, they 
are different than T4 protein j. xrified from human 
lymphocytes. 

The polypeptides of this invention are 
5 useful in immunotherapeutic compositions and methods. 
For example, the polypeptides of this invention are 
active in inhibiting infection by agents whose primary 
targets are T4* lymphocytes by interfering with their 
interaction with those target lymphocytes. More 
10 preferably, the polypeptides of this invention may 

be employed to saturate the T4 receptor sites of T4- 
targeted infective agents. Thus, they exert anti- 
viral activity by competitive binding with cell 
surface T4 receptor sites. This effect is plainly 
15 of great utility in diseases, such as AIDS, ARC and 
HIV infection. Accordingly, the polypeptides and 
me th ods of this invention may be used to treat humans 
having AIDS, ARC, HIV infection or antibodies to 
HIV. In addition; these polypeptides and methods 
20 may be used for treating AIDS-like diseases caused 
by retroviruses, such as simian immunodeficiency 
viruses , in mammals , including humans . 

According to one embodiment of this inven- 
tion, antibodies to soluble T4 proteins and polypep- 
25 tides may be used in the treatment , prevention, or 
diagnosis of AIDS, ARC and HIV infection. 

The polypeptides of this invention may 
also be used in combination with other therapeutics 
used in the treatment of AIDS, ARC and HIV infection. 
30 For example, soluble T4 polypeptides may be used in 
combination with anti -retroviral agents that block 
"reverse transcriptase, such as AZT, HPA-23, phos- 
phono formate, suramin, ribavirin and dideoxyciti- 
dine. Additionally, these polypeptides may be used 
35 with anti -viral agents such as interferons, includ- 
ing alpha interferon, beta interferon and gamma 
interferon, or glucosidase inhibitors, such as 



3674 



'890*55 1Q 



-25- 

castanospennine. such combination therapies advan- 
tageously utilize lower dosages o those agents, 
thus avoiding possible toxicity. 

And, the polypeptides of this invention 
5 may be used in plasmapheresis techniques or in blood 
bags for selective removal of viral contaminants 
from blood. According to this embodiment of the 
invention, soluble T4 polypeptides may be coupled to 
a solid support, comprising, for example, plastic or 

10 glass beads, or a filter, which is incorporated into 
a plasmapheresis unit. 

Additionally, the compositions of this 
invention may be employed as immunosuppressants use- 
ful in preventing or treating graft-vs-host disease, 

15 autoimmune diseases and allograft rejection. 

The compositions of this invention typi- 
cally comprise an immunotherapeutic effective amount 
of a polypeptide of this invention and a pharmaceu- 
tical^ acceptable carrier. Therapeutic methods of 

20 this invention comprise the step of treating patients 
in a pharmaceutical^ acceptable manner with those 
compositions. 

The compositions of this invention for use 
in these therapies may be in a variety of forms. 

25 These include, for example, solid, semi-solid and 

liquid dosage forms, such as tablets, pills, powders, 
liquid solutions or suspensions, liposomes, supposi- 
tories, injectable and inf usable solutions. The 
preferred form depends on the intended mode of admin- 

30 istration and therapeutic application. The composi- 
tions also preferably include conventional pharma- 
ceutical^ acceptable carriers and adjuvants which 
are known to those of skill in the art. 

Generally, the pharmaceutical compositions 

35 of the present invention may be formulated and admin- 
istered using methods and compositions similar to 
those used for other pharmaceutically important poly- 
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peptides (e.g., alpha- interferon ) . Thus, the poly- 
peptides may be stored in lyoph. xized form, reconsti- 
tuted with sterile water just prior to administration, 
and administered by the usual routes of administration 
such as parenteral , subcutaneous , intravenous, intra- 
muscular or intralesional routes. An effective dosage 
may be in the range of from 0.5 to 5.0 mg/kg body 
weight/day, it being recognized that lower and higher 
doses may also be useful. 

This invention also relates to soluble 
receptors and their use in diagnosing or treating 
viral agents which target or bind to those receptors. 
Such soluble receptors may be used as decoys to 
absorb viral agents and to halt the spread of viral 
15 infection. Alternatively, virus-killing agents may 
be attached to the soluble protein receptors, 
providing a direct mode of delivery of those agents 
to the virus. 

More particularly, the polypeptides of 
20 this invention are useful in diagnostic compositions 
and methods to detect or monitor the course of HIV 
infection. Advantageously, these polypeptides are 
useful in diagnosing variants of the HIV virus, 
regardless of origin of the infecting HIV agent. 
25 For example, soluble T4 proteins and poly- 

peptides accor din g to this invention, which have a 
high affinity for HIV, may be advantageously used to 
increase the sensitivity of HIV assay systems now 
based upon monoclonal or polyclonal antibodies. 
30 More specifically, soluble T4 proteins and polypep- 
tides may be used to pretreat test plasma to concen- 
- xxate any HIV present, even in small amounts, so 
that it is more easily recognized by the antibody. 
And soluble T4 proteins and polypeptides may be used 
35 to purify the HIV envelope protein gp!20. 

Alternatively, the soluble T4 proteins and 
polypeptides of this invention may be used to replace 
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anti-HIV antibodies now used in /arious assays. 
These soluble T4 proteins and polypeptides are be 
preferable to anti-HIV antibodies for two reasons. 
First, soluble T4, exhibits an affinity for HIV of 
5 approximately 10~ 9 , a level which exceeds the 10" 7 
to 10~ fi values of anti-HIV antibodies. And, while 
anti-HIV antibodies are more likely to be specific 
for different HIV isolates, strain variations would 
not affect a soluble T4 protein-based assay, since 

10 all HIV isolates must be capable of interacting with 
the T4 receptor as a prerequisite to infectivity. 

For example, a soluble T4 protein or poly- 
peptide may be linked to an indicator, such as an 
enzyme, and used in an ELISA assay. Here, soluble 

15 T4 advantageously acts as a measure of both HIV in a 
test sample and any free HIV envelope gpl20 protein. 

And, polyvalent forms of soluble T4 prote? 
or polypeptides may be produced, for example, by 
chemi cal coupling or genetic fusion techniques, th u s 

20 increasing even further the avidity of soluble T4 
for HIV. 

In order that this invention may be better 
understood, the following examples are set forth. 
These examples are for purposes of illustration only, 
25 and are not to be construed as limiting the scope of 
the invention in any manner. 

EXAMPLES 

Purification Of Native Solubilized T4 

We purified native T4 from the T4*-promono- 
30 cytic cell line U937 derived from a histocytic 

lymphoma to approximately 50% purity usir.g immuno- 
affinity chromatography as follows. 

We grew U937 cells [a gift from Dr. Scott 
Hammer, New England Deaconess Hospital] to 
35 10 6 cells/ml in RPMI 1640, 10% FCS, harvested and 

washed them in IX PBS . We then lysed the cell pellet 
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in 20 mM Tris-HCl (pH 7.7), 0.5% NP-40 (a non-ionic 
detergent), 0.2% NaDOC, 0.2 mM EGTA, 0.2 mM PMSF and 
5 pg/ml BPTI at 4 x 10 7 cells/ml. Because this 
purification was carried out in the presence of a 
5 non-ionic detergent, T4, which is normally membrane- 
bound via its hydrophobic transmembrane domain, was 
isolated as a solubilized protein. We spun the lysate 
in a GS 3 rotor for 10 min at 10,000 rpm and stored 
the supernatant at -70°C. 

10 Subsequently, we preabsorbed the clarified 

cell extract with mouse IgG-Sepharose, followed by 
protein A Sepharose and then passed the flowtlirough 
through an immunoaffinity column comprising immobil- 
ized 19Thy anti-T4 monoclonal antibody on Affigel-10 

15 [a gift from Dr. Ellis Reinherz, Dana Farber Cancer 
Institute, Boston, Massachusetts]. We washed the 
column extensively and eluted the bound material 
with 50 mM glycine-HCl (pH 2.5), 0.15 M NaCl, 0.5% 
NP-40, 5 pg/ml BPTI and 0.2 mM EGTA. 

20 We then separated 10 pi aliguots of each 

elution fraction on a 10% SDS-PAGE under reducing 
conditions, with the bands being visualized by silver 
s taini ng. As shown in Figure 1, a major silver- 
stained band of 55 Kd was visible. We theji carried 

25 out two assays on the 55 Kd protein and sequenced 
the amino terminus of the protein to confirm its 
identity as native solubilized T4 . 

Sequencing Of Native Solubilized T4 

We determined the N-terminal amino acid 
30 sequence of our solubilized native T4 which we 

isolated from a detergent extract of U937 cells by 
immunoaffinity chromatography as described above. 

Techniques for determining the amino acid 
sequences of various proteins and peptides derived 
35 from them are well known in the art. We chose auto- 
mated Edman degradation to determine the amino 
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tenninus of our solul ilized native T4. More speci- 
fically, we gel purified and electrc -luted approxi- 
mately 5 M9 of the solubilized native T4 and then 
subjected it to automated Edman degradation using a 
5 gas phase sequencer (Applied Biosystems 470A). We 
then identified the FTH-amino acids produced at each 
cycle of the Edman chemistry by high pressure liquid 
chromatography, on-line with the sequencer, in a 
PTE- amino acid analyzer (Applied Biosystems 120A). 
10 Direct analysis of the protein provided amino terminal 
sequence information which, when compared to the 
amino acid sequence deduced from the cDNA sequence 
of human T4 f Haddon et al. (1985), supra], identified 
the purified protein as human T4. 

15 Radioig" "^ass ay Of Native Solubilized T4 

To determine that our purification process 
enriched for T4, we assayed fractions from the 
imxnunoaf finity elution step in a T4-specific sandwich 
radioimmunoassay, based upon the ELISA assay of P. E. 

20 Rao et al. , in Cellular Immunology , 80, pp. 310-19 
(1983). We coated each well of a Removawell strip 
(Dynatech Labs, Alexandria, Virginia) with 50 pi of 
10 Ml/ml OKT4 antibody (ATCC #CRL 8002) or MOPC195 
(a background binding control ). in 0 . 05 M sodium 

25 bicarbonate buffer (pH 9.4) at 4°C overnight. We 

washed the wells and then filled them with 1% FCS in 
PBS to saturate the protein binding capacity of the 
plastic. After removing the 1% FCS solution, we 
added test samples, in 50 pi aliquots, to the wells. 

30 We then incubated the samples for 4 hours at room 
temperature. Subsequently, we removed the samples 
and washed the wells four times with 0.05% Tween-20 
in PBS. We then added 125 I -labelled 19Thy antibody 
(50,000-100,000 cpm per well) and incubated the wells 

35 at 4°C overnight. We then washed the wells four 
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125 

rimes and separated e\ch well for bound I detection 
in a Beckman gamma detector. 

As shown in Figure 1, in which values were 
plotted following subtraction for background , the 
5 peak fraction of solubilized native T4 protein 

detected by radioimmunoassay coincided with elution 
of the 55 Kd protein seen by silver staining. 

Western Blot Assay For T4 

Although many antibodies have been developed 
10 for detecting T4 antigen, none are useful for protein 
blot analysis (Dr. Ellis Reinherz, personal communi- 
cation). In order to develop antibodies useful for 
Western blot detection of soluble T4 to follow the 
purification of T4 and recombinant soluble T4, we 
15 raised polyclonal, hyperimmune anti-T4 antisera in 
rabbits against three synthetic T4 oligopeptides. 
These oligopeptides are represented in Figure 3 as 
follows : 

Oligopeptide Amino Acid Coordinates 
20 JB-1 44-63 

JB-2 133-156 
JB-3 325-343 
We had previously synthesized these peptides using 
conventional phosphoamide DNA synthesis techniques. 
25 See, e.g.. Tetrahedron Letters , 22, pp. 1859-62 

(1981). We synthesized the peptides on an Applied 
Biosystems 380A DNA Synthesizer and purified them by 
gel electrophoresis. 

(i) Coupling Of T4 Peptides To BTG 

30 We coupled each of these peptides to the 

carrier protein bovine thyrogobulin ("BTG") [Sigma, 
St. Louis, Missouri] according to a modification of 
procedures set forth in J. Rothbard et al., J. Exp. 
Med. , 160, pp. 208-21 (1984) and R. C. Kennedy et al., 

35 "Antiserum To A Synthetic Peptide Recognizes The 
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HTLV-III Envelope Gl:"=oprotein" , Science , 231, 
pp. 1556-59 (1986). 

More specifically, we mixed 10 mg of BTG 
diluted in 1 ml of PBS with 1.3 mg of m-maleimido- 
5 benzoyl -N-hydroxysuccinimide ester ( M MBS M ) in 0.5 ml 
of dimethyl formamide ( M DMF W ). We mixed the reaction 
mixture well and reacted it for about 1 hour at 25 °C. 
Subsequently, we loaded the mixture onto a Sephadex 
G25 gel filtration column (Pharmacia, Sweden) w hich 
10 had been pre-equilibrated with C.l M PBS (pH 6.0). 
We then collected a total of thirty 2 ml aliquot 
elution fractions and read the absorbance of each 
fraction at 280 nm (-A 280 w ). we then pooled the 
three peak fractions (15, 16 and 17) to create the 
15 activated carrier. 

We dissolved 10 mg of NaBH 4 in 2.5 ml of 
0.1 M sodium borate solution to produce a sodium 
borohydride solution. Subsequently, we diluted 
approximately 8 ng of each of synthetic T4 peptides 
20 jb-1, JB-2 and JB-3 with 1 ml of 0.1 M borate buffer 

and then mixed each solution with 200 pi of the sodium 
borohydride solution, incubating the mixture on ice 
for 5 minutes. We then warmed each peptide solution 
to 25°C, brought each solution to pH 1.0 with IN. 
25 HCl (during which frothing occurred) and then brought 
each solution to pH 7.0 with 1 N NaOH ( after the 
fro thi ng had stopped). 

We then coupled each peptide to BTG by 
adding 1.2 ml of the peptide solution to 6 ml of the 
30 activated carrier solution. We allowed the coupling 
reaction to proceed overnight by incubating the reac- 
* tion mixture at room temperature . 

(ii) Inoculation Of Test Animals 

We dissolved each of the BTG-coupled pep- 
35 tides prepared above in sterile Freund's complete 

adjuvant, to a final concentration of 1 pg/ml coupled 
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peptide in PBS . subsequently, v~ inoculated each of 
tiiree rabbits (New Zealand white) by intramuscular 
injection of 500 pg of one of the coupled peptides 
into eaeh rabbit. We inoculated a fourth rabbit 
5 (New Zealand white) in the same manner with a mixture 
of the three coupled peptides. All rabbits were 
prebled prior to boosting to establish an average 
baseline for each response to be measured. The 
rabbits were boosted at 6 weeks with S00 pg coupled 
10 peptide in incomplete Freund's adjuvant. 

Serum was collected from each rabbit monthly 
for 4 months after immunization. The serum was then 
assayed for antipeptide titer. 

(iii) ELISA with Antipeptide Sera 
15 Against Peptide Coated Plates 

In this assay, we determined that antiserum 
raised in an animal by each of peptides JB-1, JB-2 
and JB-3 binds to that peptide. Accordingly, tbose 
peptides are immunogenic and elicit a response in 
20 test animals. 

To carry out the assay, we coated immulon-2 
(Dynatech Labs, Alexandria, Virginia) microti ter 
plates with 50 pi per well of 50 pg/ml uncoupled 
peptide in PBS and incubated the plates overnight at 
25 4°C. Plates coated with peptide 46R*, which served 

as controls, were treated identically. We then washed 
the plates 4 times with PBS-Tween (0.5%) and 4 times 
with water. The plates were blotted dry by gentle 
tapping over paper towels. After blotting the plates, 

30 " 

* Peptide 46 corresponds to amino acids ( "AA M ) 
728-751 of the env gene of the HIV genome. The amino 
acid numbering corresponds to that set forth for the 
env gene in L. Ratner et al., "Complete Nucleotide 
35 Sequence Of The AIDS Virus, Hn,V-III M , Nature , 313, 
pp. 277-84 (1985). Peptide 46 has the sequence: 
LP IP RCPDRPEG I EEEGGERDRDR . 
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we added 200 pi of a S% FCS/PBS solution to each 
well and incubated the plates fox 1 hour at room 
temperature . 

We then assayed serum samples from the 
5 rabbits on the pre-coated plates prepared as described 
above. We assayed the antibody response to the 
immunogen peptide at an initial dilution of 1:100, 
followed by serial 10- fold dilutions in 5% FCS/PBS. 

After a 2 hour incubation period at room 

10 temperature, we washed the plater: and blotted them 
dry as described above* We then added 50 pi of a 
1:1500 dilution of horseradish peroxidase ("HRP")- 
conjugated goat anti -rabbit- I gG [Cooper Biomedical, 
Malvern, Pennsylvania] in 5% FCS/PBS to each well 

15 and incubated the plates at room temperature for 

1 hour. We washed the plates with PBS-Tween 0.5%. 
We then added 50 m1 of 0.42 oM TMB. We stopped the 
enzyme reactions with 50 pi of 2 M H^C^. We then 
analyzed the plates spectrophotometrically at 450 run 

20 using a microti ter plate reader [Dynatech Labs, 
Alexandria, Virginia]. 

We observed that antiserum against each of 
peptides JB-1. JB-2 and JB-3 binds to the corre- 
sponding peptide. We also observed that antiserum 

25 against a mixture of peptides JB-1, JB-2 and JB-3 

binds to peptides JB-1 and JB-3 under the conditions 
set forth above. The titers of each of the four 
an riser a tested against the peptides in the solid- 
phase ELISA are shown below, where "ND" represents 

30 values not determined: 

Approximate Titer Against: 

Peptide JB-1 JB-2 JB-3 

jb-1 >l/50,000 0 ND 

JB-2 0 1/50,000 ND 

35 JB-3 0 0 1/10,000 

JB-1 ♦ JB-2 ♦ JB-3 1/4,000 ND 1/7,000 
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Ig fractions from two of the three anti- 
peptide sera raised against indi idual peptides, 
anti-JB-1 and anti-JB-2, recognized the 55 Kd T4 
antigen band of native solubilized T4 in a Western 
5 blot analysis of protein eluted from the 19Thy 
(anti-T4) monoclonal antibody affinity column 
described above. As in the case of the radioimmuno- 
assay of native solubilized T4, the detection of the 
55 Kd protein coincides with its apparent elution 

10 from the affinity column- This provides further 

evidence that our T4 purification procedure enriched 
for solubilized T4. 

Thus, these polyclonal sera are useful in 
the detection of nanogram quantities of T4 (both 

15 native and recombinant forms) by Western analysis. 

Binding of Cell-Free T4 To HIV Envelope 

We then tested our purified solubilized 
native T4 isolated from U937 cells for its ability 
to bind to the HIV envelope protein gpl60/gp!20. To 

20 carry out this direct binding assay, we incubated 
35 S-labelled gp!60/gpl20 detergent cell extract 
derived from a recombinant cell line 7d2 (a gift 
from Drs- Mark Kowalski and William Haseltine, Dana- 
Farber Cancer Institute) with samples of solubilized 

25 native T4, each of which had been preincubated with 
one type of monoclonal antibody. 

More specifically, we mixed 5 pi of solu- 
bilized T4 in a microfuge tube with 5 pg (about 3 pi) 
of OKT4 (ATCC 3CRL 8002), a monoclonal antibody 

30 recognizing an epitope on T4 which does not interfere 
with HIV binding [J. A. Hoxie et al., J. Immunol. , 
136, pp. 361-63 (1986)] or with 5 pg of OKT4A (Ortho 
Diagnostics #7142), a monoclonal antibody that inter- 
feres with HIV binding to T4 positive cells [J. Steven 

35 McDougal et al., J. Immunol. , 137, pp. 2937-2944 

(1986)]- Alternatively, we mixed 50 pi of solubilized 
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T4 with 5 pg of aHTLV III gpl20 (Dupont #NEN-9284). 

We then incubated the mixtures or* ice for 1 hour. 

35 

Subsequently, we added 150 pi of S- 
labelled gpl60/gpl20 cell extract or 35 S-labelled 
5 control cell extract (precleared with protein-A 

Sepharose) to the preincubated solubilized T4/mono- 
clonal antibody mixtures and rocked the tubes over- 
night at 4°C. We then precipitated the T4/gpl60/gpl20 
immune complexes by adding 30 pi of protein-A 

10 Sepharose to each tube and rocklrg for 2 hours at 

4°C to allow the protein-A Sepharose to bind to the 
antibody complexes. Subsequently, we spun down the 
beads in an Eppendorf mlcrofuge and after extensive 
washings, we eluted with 40 pi SDS sample buffer at 

15 65°C for 10 minutes. We then loaded 20 pi of the 

eluted material on a 7.5% SDS-PAGE gel which was run 
under reducing conditions. 

Figure 2 depicts autoradiograph and Western 
blot results of the T4/gpl60/gpl20 coimmunoprecipita- 

20 tions . In Figure 2, lanes 1-5 were autoradlographed 
after treatment with 40% sodium salicylate and lanes 
6-7 were developed on a Western blot with rabbit 
antlsera JB-2. 

As shown In Figure 2, gpl60/gpl20 protein 

25 was colmmunoprecipitated in the presence of T4 with 
OKT4 (lane 5) but not in the presence of T4 with 
OKT4A (lane 4). Lane 3 shows the positive control 
for gpl60/gpl20 using oBTLV III gp!20 monoclonal 
antibody. Neither negative control with 35 S-labelled 

30 control extract (lane 1) or protein-A Sepharose alone 
flane 2) showed bands migrating in the position of 
gpl60/gpl20. Based upon the bands that developed 
on the Western blot, the amount of T4 precipitated 
with either OKT4 (lane 6 ) or OKT4A (lane 7) appeared 

35 to be similar - 

This demonstrates that purified, solubilized 
native T4, which is naturally membrane bound, can 

3 6 8 5 
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still interact with the HIV glycoprotein in solution. 
Accordingly, we believe that cell free soluble T4 is 
useful in preventing the binding interaction between 
HIV and the T4 receptor of T4* lymphocytes- By com- 
peting with cell surface T4 for binding to the HIV 
envelope protein gpl20, soluble T4 is useful in block- 
ing HIV infection. 

Synthesis Of Oligonucleotide DKA Probes 

The nucleotide sequence and a deduced amino 
acid sequence for a cDNA that purportedly encodes 
the entire human T4 protein have been reported 
f Maddon et al. , (1985). supra). The deduced primary 
structure of the T4 protein reveals that it can be 
divided into domains as demonstrated below: 

Amino Acid 

Structure/Proposed Location Coordinates 
Hydrophobic/Secretory Signal 



10 



15 



-23 to -1 



Homology to V-Regions/ 

Extracellular ♦! tu> +94 

20 Homology to J-Regions/ 

Extracellular +95 to +109 

Glycosylated Region/ 

Extracellular +110 to +374 

Hydrophobic/Transmembrane 
25 Sequence 

Very Hydrophilic/ 

Intracytoplasmic +396 to +435 



+375 to +395 



Based on the sequence for the above- listed 
domains, we chemically synthesized antisense 

30 oligonucleotide DNA probes using conventional phos- 
phoami de DNA synthesis techniques. See, e.g.. 
Tetrahedron Letters , 22, pp. 1859-62 (1981). We 
synthesized the probes on an Applied Biosystems 380A 
DNA synthesizer and purified them by gel electro- 

35 phoresis. 
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Furthermore . we synthesized the probes 
such that they were complementary to the DNA 
sequences which code for the amino acid sequence, 
i.e., the probes were antisense, to enable them to 
5 recognize and hybridize to the corresponding sequences 
in DNA, as well as in mRNA. The nucleotide sequences 
of the eleven selected regions of the T4 protein 
[corresponding to the nucleotide numbering set forth 
in Maddon et al . , (1985), supra] were the following: 



10 


Olioonucleoti.de 


nucleotide 
Coordinates 




1 


145-171 




2 


742-765 




3 


1414-1440 


is 


6 


427-453 




7 


1303-1329 




8 


1012-1038 




9 


97-118 




10 


10-36 


20 


11 


1698-1724 




12 


397-423 




14 


261-287 



Before using our DNA probes for screening, 

we 5 > end-labelled each of the single-stranded DNA 

32 32 
25 probes with P using [ y- P]-ATP and T4 polynucleo- 
tide kinase, substantially as described by A.. M. Max am 
and W. Gilbert, "A New Method For Sequencing DNA", 
-Proc. Natl. Acad, Sci . USA , 74, pp. 560-64 (1977). 

Construction of AgtlO Peripheral Blood 
30 Lvmphocvtes cDNA Library 

To prepare our Peripheral Blood Lymphocytes 
(PBL) cDNA library, we processed PBL, from a single 
leukophoresis donor, through one round of absorption 
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to remove monocytes. We then stimulated the non- 
adherent cells with IFN-y 1000 Vml and 10 pg/»l PHA 
for 24 hours, we isolated RNA from these cells using 
phenol extraction [Maniatis et al.. Molecular Cloning , 
5 p. 187 (Cold Spring Harbor Laboratory) (1982)] and 

prepared poly A* mRNA by one round of oligo dT cellu- 
lose chromatography. We ethanol precipitated the 
RNA, dried it in a speed vac and resuspended the RNA 
in 10 pi HjO (0.5 pg/pl ) . we treated the RNA for 10 

10 min at room temperature in CH^EgOH (5 mM final con- 
centration) and p-mercaptoethanol (0.26 M) . We then 
added the methyl mercury treated RNA to 0.1 M Tris-HCl 
(pH 8.3) at 43°C, 0.01 M Mg, 0.01 M DTT, 2 mM Vanadyl 
complex, 5 M9 oligo dT 12 _ 18# 20 mM KC1. 1 mM dCTP, 

15 dCTP, dTTP, 0.5 mM dATP. 2 M Ci [o- 32 P)dATP and 30 U 

1.5 pi AMV reverse transcriptase (SeikagaJcu America) 
in a total volume of 50 pi. We incubated the mixture 
for 3 minutes at room temperature and then for 3 hours 
at 44 *C, after which time we stopped the reaction by 

20 the addition of 2-5 pi of 0.5 M EDTA. 

We extracted the reaction mixture with an 
equal volume of phenol: chloroform (1:1) and precipi- 
tated the aqueous layer two times with 0.2 volume of 
10 M NH^AC and 2.5 volumes EtOH and dried it under 

25 vacuum. The yield of cDNA was 1.5 pg. 

We synthesized the second strand according 
to the methods oif Okayama and Berg [ Mol . Cell . Biol . . 
2, p. 161 (1982)] and Gubler and Hoffman [ Gene , 25, 
pp. 263-69 (1983) J, except that we used the DNA poly- 

30 merase I large fragment in the synthesis. 

We blunt ended the double-stranded cDNA by 
resuspending the DNA in 80 pi TA buffer (0.033 M Tris 
Acetate (pH 7.8); 0.066 M KAcetate; 0.01 M MgAcetate; 
0.001M DTT; 50 pg/ml BSA ) , 5 pg RNase A, 4 units RNase 

35 H, 50 pM p NAD , 8 units E.coli ligase, 0.3125 mM 
dATP, dCTP, dGTP , and dTTP , 12 units T 4 polymerase 
and incubated the reaction mixture for 90 min at 
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37°C, added 1/20 volume of 0 . 5M EDTA, and extracted 
with phenol: chloroform. We chrcnatographed the 
aqueous layer on a C150 Sephadex column in 0.01M 
Tris-HCl (pH 7.5), 0.1 M NaCl, 0.001 M EDTA and 
5 collected the lead peak containing the double-stranded 
cONA and ethanol precipitated it. Yield: 0.605 pg 
CONA. 

We ligated the double-stranded cONA to 
linker 35/36: 

10 5 f AAXTCGAGCTCGAGCGCGGCCGC3 1 

3 • GCTCGAGCTCGCGCCGGCG5 » 

using standard procedures. We then size selected 
the cDHA for 800 bp and longer fragments on a S500 
Sephacryl column, and ligated it to EcoR I -digested 

15 bacteriophage lambda vector gtlO (a gift of 

Dr. Ellis Reinherz ) . We packaged aliquots of the 
ligation reaction in Gigapak (Strategene) according 
to the manufacturer's protocol. We used the packaged 
phage to infect E.coli BHN102 cells and plated the 

20 cells for amplification. The resulting library con- 
tained 1.125 x .10* independent recombinants. 

We also screened a PBL cDNA library in the 
bacteriophage lambda vector gtlO (a gift of Dr. Ellis 
Reinherz ) , which was synthesized from mRNA from a 

25 T4* tumor cell line named REX, which expresses T4 

protein at high levels [O. Acuto et al . , "The Human 
T Cell Receptor: Appearance In Ontogeny And 
Biochemical Relationship Of Lambda and Beta Sub units 
on IL-2 Dependent Clones And T Cell Tumors", Cell , 

30 34, pp. 717-26 (1983)]. 

Screening Of The Libraries 

We then used three of our 32 P-labelled 
synthetic oligonucleotide antisense probes, probes 3, 
6 and 9, to screen in parallel our two XgtlO cDNA 
35 libraries using the plague hybridization screening 

technique described in R. Cate et al . , "Isolation Of 
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The Bovine And Human Genes For Mullerian Inhibiting 
Substance And Expression Of The I am an Gene In Animal 
Cells", Cell , 45 , pp. 685-98 (1986), with minor 
modifications. We modified the Cate et al . proce- 
5 dure by hybridizing without tetramethyl ammonium 
chloride to accommodate our use of unique probes, 
rather than mixtures, to probe the plaque filters. 

We used the three probes, which had been 
previously 5* end-labelled with [*- 32 P]-AXP according 
10 to the method of A. Max am and W. Gilbert, Meth. 

Enzymol . , 68, pp. 499-80 (1979) to screen in parallel 
the PBL cDNA library and the REX cDNA library dis- 
cussed above. 

From our screening of the PBL library, we 
15 isolated a nearly full length soluble T4 cDNA clone — » 
X203-4 (or Agtl0.PBI.-T4) — containing a 3.064 kb 
insert which could be cleaved from the AgtlO vector 
with EcoR I . 

From our screening of the REX cell library, 

20 we isolated an incomplete T4 cDNA clone containing 

a 1,200 bp cDNA insert. We then further char acterized 
the DNA from these clones by DNA sequencing analysis. 

We also screened a bacteriophage lambda 
human genomic library, constructed in the vector 

25 EMBL3 by Dr. Mark Pasek (Biogen Inc., Cambridge, 

Massachusetts ) [N. Murray in Lambda 2, eds. R. Hendrix, 
J. Roberts, F. Stahl, R. Weisberg, pp. 3935-422 (1983) J. 
The library contains DNA fragments , created by partial 
restriction of chromosomal DNA from the human lympho- 

30 blastid cell line GM1416,48, XXXX (Human Genetic 
Mutant Cell Repository, Camden, New Jersey) with 
Sau 3a, ligated onto EMBL3 arms which had been sub- 
jected to cleavage with BamHI according to the pro- 
cedures outlined in Maniatis et al . . (1982), supra. 

3 5 Plating of the phage library, lysis, and transfer of 
the phage DNA onto nitrocellulose were performed as 
described by w. D . Benton and R. w. David, "Screening 
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of Lambda gt Recombinant Clones By Hybridization To 
Single Plaques In Sicu". Scienc e, 196. p. 180 (1977) 
and Maniatis et al. (1982). Hybridisation conditions 
were those described by Cate et al. (1986). supra. 
5 except that tetr ame thy 1 ammonium chloride (TMACl) was 
omitted from the washing buffer. 

Approximately 2 million plagues were 
screened in parallel hybridizations with probe 1 
and probe 3 discussed above. One phage, called 
10 CM47, which hybridized with probe 3 in the primary 
scre ening s, was subjected to DNA sequence analysis 
to determine the existence and position of an intron 
between the coding sequences for the predicted extra- 
cellular and transmembrane domains. No phage clones 
15 containing T4 sequences were found screening with 
probe 1. probably because it includes a sequence 
interrupted by an intron (D. R. Littman and S.N. 
Gettner. Nature . 325. pp. 453-55 (1987); and our 
observations ] - 
20 partial sequence analysis of CM47 shows 

that an intron interrupts the sequence corresponding 
to the codon for valine (amino acid 363) of the 
deduced primary sequence for T4 ( Figure 3 — in which 
introns are indicated by a solid line). This intron 
25 defines a potential site for introducing a stop codon 
in order to express a soluble form of T4- Another 
intron found within the coding sequence for T4 inter- 
rupts the codon for arginine (amino acid 295) and a 
third intron in CM47 is found between the codons for 
30 arginine (amino acid 402) and arginine (amino acid 
403) (Figure 3). 

Sequencing Of cDNA Clones 

We then subcloned EcoR I digested DNA from 
clone X203-4 into animal expression vector pBG312 
35 f R. Cate et al . . supra] to facilitate sequence 

analysis . More specifically, as depicted in Figure 4. 
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we then digested Agtl0 v PBL.T4 v: -h EcoRI to excise 
the 3-064 kbp Eco RI - Eco RI fragment containing the 
full length T4 cDKA. This cONA sequence, including 
the entire coding region for soluble T4 and for full 
5 length T4 was deposited in pl70-2. We used T4 ligase 
to ligate the fragment into animal expression vector 
pBG312 [supra] which had been previously cut with 
EcoR I , to form pBG312.T4 and p!70-2 (Figure 4). We 
then determined the nucleotide sequence of the EcoR I 
10 fragment of pBG312.T4 using Maxam Gilbert technology 
[A. M. Maxam and W. Gilbert, "A New Method For 
Sequencing DNA", Proc. Natl - Acad. Sci. USA , 74, 
pp. 560-64 (1977)) (see Figure 3, which depicts the 
PBL cDNA sequence in comparison to that reported by 
15 Maddon et al. t (1985), supra). This analysis showed 
that the 3.064 kbp PBL full length complementary DNA 
copy of T4 cDNA contained the coding sequence for 
T4, approximately 200 bp of 5' noncoding sequence 
and approximately 1500 bp of 3* noncoding sequence. 
20 We then cut pBG312.T4 with PstI and removed 

the resulting 3* protruding ends with Klenow and 
isolated an approximately 2.5 kbp fragment. We then 
inserted the fragment into the polylinker of pBG312 
(which had been previously restricted at the Sma l 
25 site) to form plasmid pl70-2, which contains the 
full length PBL T4 cDNA sequence (see Figure 3). 

As depicrted in Figure 3, the PBL T4 cDNA 
con tains - a nucleotide sequence almost identical to 
the approximately 1,700 bp sequence reported by 
30 Maddon et al. , (1985), supra. The PBL T4 cDNA, how- 
. ever, contains three nucleotide substitutions that, 
in the translation product of this cDNA, would pro- 
duce a protein containing three amino acid substi- 
tutions compared to the sequence reported by Maddon 
35 et al. As shown in Figure 3, these differences are 
at amino acid position 3. where the asparagine of 
Maddon et al . is replaced with lysine; position 64, 
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where the tryptophan of Maddon t al. is replaced 
with arginine and at position 231, where the phenyl- 
alanine of Maddon et al . is replaced with serine. 
The asparagine reported at position 3 of Maddon 
5 et al. instead of lysine was the result of a sequenc- 
ing error (Dr. Richard Axel, personal communication). 
The significance of the amino acid replacements at 
positions 64 and 231, which may represent alleliic 
polymorphism [T. C. Fuller et al.. Human Imm unology, 
10 9, pp. 89-102 (1982); W. Stohl and H. G. Kunkel, 

Scand. J . Immunol. , 20. pp. 273-78 (1984); N. Amino 
et al., Lancet , 2, pp. 94-95 (1984); and M. Sato 
et al., J. Immunol. , 132, pp. 1071-73 (1984)], is 
not known. 

15 DNA sequence analysis [Maxam and Gilbert , 

supra] of the insert in pEClOO of the REX clone sug- 
gests that it represents the product of a splicing 
error, because 5 • noncoding sequence appears to have 
been spliced with coding sequence beginning with the 

20 GGT codon for glycine (amino acid 49) (see Figure 3 
and Figure 5). The T4 coding sequence in pEClOO* 
from glycine (amino acid 49) to isoleucine (amino 
acid 435) is identical to the sequence of Maddon 
et al. , (1985), supra. 

25 In comparison, our earlier N- ter minal pro- 

tein sequence analysis of native T4 protein purified 
from* U937 cells shows a T4 expression product with 
aspargine as amino acid 3. These differences are 
also set forth in Figure 6, which also depicts com- 

30 parisons at corresponding positions of the partial 
clone from the REX cell line XgtlO library; our 



* We constructed pEClOO by digesting the incomplete 
T4 cDNA clone from the REX library with EcoRI and 
35 isolating the 1,200 bp cDNA insert. We then ligated 
it to pUC12 (Boehringer Mannheim. Indianapolis, 
Indiana) which had been previously cut with EcoRI to 
form pEClOO. 
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genomic clone from a XJMBL3 library; nouse T4 
sequences [Tourvieille et al.. Science. 234, p. 610 
(1986)] and sheep T4 sequences (Classon et al., 
Tmmunooenetics , 23, p. 129 (1986)]. 

5 construction of S oluble T4 Mutants 

we then employed the technique of in vitro 
site-directed mutagenesis and restriction fragment 
substitution to modify the T4 cDNA coding sequence 
of pl70-2 in sequential steps to be identical to 
10 that reported by Maddon et al. . (198S), supra. We 
first used oligonucleotide-directed mutagenesis to 
modify the amino acids at positions 3 and 64. Next, 
we employed restriction fragment substitution with a 
fragment including the serine 231 codon of a partial 
15 T4 cDNA isolated from a T4 positive lymphocyte cell 
line [O. Acuto et al.. Cell . 34. pp. 717-26 (1983)] 
library in Xgtll (a gift from Dr. Ellis Reinberz). 
to modify the amino acid at position 231. We then 
truncated our modified T4 cDNA sequence to remove 
20 the coding regions for the transmembrane and intra- 
cytpplasmic domains. Subsequently, we constructed 
three different soluble T4 mutants from our full 
length T4 clone PBL T4 by linker insertion between 
restriction sites in order to increase the probability 
2S of empirically finding a stable, secretable T4 mole- 
cule. The structure of each of these mutants is 
depicted in Figure 7A. 

Line A of Figure 7A represents a hydropathy 
analysis of our full length soluble T4 carried out 
30 using a computer program called Pepplot (University 
"of Wisconsin Genetics Computer Group) according to 
J. Kyte and R. F. Doolittle, J. Mol . Biol. , 157. 
pp. 105-32 (1982). Line B depicts the protein domain 
structure of full length T4 f Maddon e t al . , (1985) 
35 supra] in which M S W represents the secretory signal 
sequence. "V" represents the immunoglobulin-like 
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variable region sequence, "J" represents the immuno- 
globulin-like joining region sequence, M U H represents 
the unigue, extracellular region sequence, "TM M 
represents the transmembrane sequence and "C M repre- 
5 sents the cytoplasmic region sequence. In line B, 

the transmembrane amino acid sequence and some flank- 
ing sequence is written below the TM domain. Line C 
depicts the protein domain structure of recombinant 
soluble T4 mutants rsT4.1 in pBG377, rsT4.2 in pBG380 
10 and rsT4-3 in pBG381. Line D represents the protein 
domain structure of E.coli rsT4 gene (Met-perfect 
construct) (pl99-7) which is deleted for the T4 
N- terminal signal sequence (S). 

We constructed the first three soluble T4 
15 mutant gene fragments by truncating our full length 
soluble T4 cDNA at positions corresponding to either 
intron/exon boundaries or to protein domain boundaries 
defined by hydropathy analysis predictions. More 
specifically, we introduced synthetic linkers into 
the unique Ava l site that is 5' to the transmembrane/ 
extracellular domain boundary to produce an in- frame 
translational stop codon, thus constructing T4 genes 
that lack the transmembrane and cytoplasmic domains 
of the full length T4 sequence. 

For example, mutant rsT4.1 in pBG377 was 
truncated by the insertion of a stop codon following 
ami no acid 362, lysine, which corresponds to the 
position o£ an intron separating the extracellular 
and tr ans membrane domain exons. The positions both 
30 of thi s intron and of the adjacent intron that splits 
the transmembrane and cytoplasmic domains were deter- 
mined by DNA sequence analysis of chromosomal T4 
clones isolated from the \EMBL3 genomic library 
described above. Although the significance of the 
35 intron positions flanking the T4 transmembrane domain 
is hot known, the determination of the genetic struc- 
ture could provide important information for design - 
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ing rsT4 mutants, sir re exons frequently define 
functional domains [W. Gilbert, "Why Genes In Pieces?", 
Nature , 271, p. 501 (1978)]. 

we then constructed mutant rsT4.2 in pBG380 
5 by truncating the T4 cDNA at the boundary of the 
transmembrane and extracellular domains at amino 
acid 374. And, we constructed mutant rsT4.3 in 
pBG381 by truncating the T4 cDNA at amino acid 377, 
three amino acids downstream from the transmembrane/ 
10 extracellular domain boundary and within the trans- 
membrane domain . 

We also employed the technique of oligo- 
nucleotide site directed mutagenesis, according to 
D . strauss et al., "Active Site Of Triosephosphate 
15 Isomerase: In Vitro Mutagenesis And Characterization 
Of An Altered Enzyme", Proc. Natl . Acad. Sci- USA , 
82, pp. 2272-76 (1985), to construct a fourth soluble 
T4 mutant from our full length T4 clone PBL T4. The 
structure of this mutant is depicted in Figure 7A, 
20 line D, which represents the protein domain structure 
of E.coli rsT4 gene (Met-perfect rsT4.2) construct, 
deposited in pl99-7, which is deleted for the T4 
N- ter minal signal sequence ( S ) . 

We also constructed various other soluble 
25 T4 deletion mutants to determine which smaller 

fragments of the T4 sequence provide a protein which 
binds to HIV. These constructions were based on our 
belief that only the amino terminal sequence of T4 
is required for binding to HIV. This belief, in turn, 
30 was based upon observations that the monoclonal anti- 
body OKT4A blocks infection of T4 positive cells by 
HIV and it appears to recognize an epitope in the 
amino portion of T4 [ Fuller et al . , supra]. Such 
fragments of T4, which lack glycosylation and which 
35 are capable of binding HIV and blocking infection, 

may be produced in E.coli or chemically synthesized. 
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The structure of each of these deletion 
mutants is depicted in Figure 7B. In that figure, 
line A depicts the protein domain structure of full 
length T4 f Maddon et al. , (1985), supra; Figure 7A] ... 
5 In line B, the protein structure of recombinant 

soluble T4 mutants are depicted as follows: rsT4.7 
in p203-S, rsT4.7 in pBG392, rsT4.8 in pBG393, rsT4.9 
in pBG394, rsT4.10 in pBG395, rsT4.11 in pBG397, 
rsT4.12 in pBG396, rsT4.111 in pBG215-7, rsT4. 113.1 

10 in pBG211-ll and rsT4.113.2 in pBG214-10. 

We constructed soluble T4 derivatives 
p203-5, pBC392, pBG393, pBG394 and pBG396 by trun- 
cating our rsT4.2 gene after the StuI sites at amino 
acids 183 and 264 of rsT4.2. More specifically, we 

15 constructed derivative rsT4.7 in p203-5 and in pBG392 
by truncating the rsT4.2 cDNA at amino acid 182. 
And, we constructed each of derivatives rsT4.9 in 
pBG394 and rsT4 . 12 in pBG396 by truncating the rsT4.2 
cDNA at amino acids 113. and 166, respectively. One 

20 may also construct each of derivatives rsT4.10 in 
pBG395 and rsT4.11 in pBG397 by truncating the 
rsT4.2 cDKA at amino acids 131 and 145, respectively. 



Expression of T4 and Soluble T4 
Polypeptides In Bacterial Cells 

25 The cDNA sequences of this invention can 

be used to transform eukaryotic and prokaryotic host 
ce lls by techniques well known in the art to produce 
recombinant soluble T4 polypeptides in clinically 
and commercially useful amounts. 

30 For example , we constructed expression 

- vector pl99-7, as shown in Figure 9A, as follows. 

We preceded the construction depicted in 
Figure 9A by the construction of various intermediate 
plasmids, as depicted in Figures 8A-8D. Those con- 

35 structions were carried out using conventional 
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recombinant techniques. The li-.kers employed in 
those constructions are set forth in Figure 10. 

As depicted in Figures 8A and 8B, starting 
with pl70-2, which contains our full length T4 DNA 
5 sequence, coding for T4 characterized by three dif- 
ferent ami no acids than that of Maddon et al . , (1985), 
supra, we produced various constructs which direct 
the expression of soluble T4. Some of these con- 
structs are characterized in that one or more of 
10 those amino acid differences have been changed to 
corre spond to the respective amino acids of Maddon 
et al In thi s figure, as well as in the other 
figures, amino acid changes are reflected by an 
arrow. 

15 Plasmid pl92-6 contains the Met perfect 

rsT4.2 sequence derived by oligonucleotide site- 
directed mutagenesis which removed the entire T4 
N— terminal signal sequence as shown in Figure 8C 
And, to provide a convenient means of transferring 
20 the rsT4.2 Met perfect sequence into E.coli expression 
vectors, the steps described in Figure 8D were carried 
out to produce pl95-8, a plasmid containing the Met 
perfect rsT4.2 sequence flanked by Cla l restriction 
sites. The Cla l - Cla l cassette of pl95-8 optimizes 

25 the distance between the 5 • Cla l site and the 
initiating Met codon. In Figure 8D, ST8 rop~ 
is a tetracycline resistance encoding pAT153- 
based plasmid containing the rop~ mutation that 
permits high plasmid copy number, a promoter and 

30 ribosome binding site from bacteriophage gene 32 and 
the gene 32 transcription termination sequence. 

Cleavage of pl95-8 with Cla l produced the 
fragment used to assemble pl99-7, a construction 
which directs the expression of Met perfect rsT4.2 

35 under the control of the P L promoter (Figure 9A). 

As the first step, to construct a vector from which 
rsT4.2 expression is under control of the P L promoter. 
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we constructed the vector P197-1" from pl034 
(plmuGCSF) (Figure 9A). 

We then cut p!034 with EcoRI and BamHI to 
excise the GCSF cDNA insert and a portion of the 
5 phage mu ribosome binding site sequence — which we 
subsequently reconstructed with oligonucleotides. 
The synthetic linkers used were linkers 57-60 
(Figure 10). 

we then ligated the synthetic linker into 
10 the EcoR I /BamHI -cut pl034 to form pl97-12. One could, 
instead, replace these steps by starting with any 
suitable E.coli expression vector containing a Cla l 
site appropriately placed between the promoter and 
ter mina tor sequences. We cut pl97-12 with Cla l and 
15 inserted a Clal -Clal cassette containing the cDNA 

sequence of rsT4.3 in pBG381 and phage transcription 
ter min ator derived from p!034. The sequence of this 
cassette is depicted in Figure 11. The resulting 
plasmid, pl99-7. contains the rsT4.2 "Met perfect" 
20 gene in that vector. 

Alternatively, one could derive the Met 
perfect rsT4.2 sequence from plasmid pBG380, 
deposited in connection with this application, and 
gap out the signal sequence to create pl92-6. 
25 We tested for expression of p!99-7 as 

follows. SG936, an E.coli Ion htpr double mutant 
[AXCC 39624] [S. Goff and A. Goldberg, M ATP-Dependent 
Protein Degradation In E.coli " , in Maximizing Gene 
Expression , w. Reznifcoff and L. Gold (eds.) (1986)], 
30 was transformed with p!99-7 by conventional proce- 
dures f Maniatis et al, (1982)] to form SG936/pl99-7 , 
a trans f onnant containing a plasmid with the Met- 
perfect rsT4.2 gene behind the P L promoter. Trans - 
formants were selected on LB agar plates containing 
35 10 mcg/ml tetracycline (tet). After streaking out 

several single colonies for single colony isolation, 
one was chosen at random for testing induction of 
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rsT4.2 synthesis. We picked a single colony from an 
LB-agar tet**" plate into 20 ml L„ria Broth (LB) and 
10 mcg/ml tet in a 12S ml shake flask and grew it 
overnight in a shaking air incubator (New Brunswick 
Scientific. New Jersey) at 30°C. 

We then initiated an induction culture by 
adding 0.5 ml of the overnight culture to 50 ml LB 
and tet in a 500 ml flask which was grown at 30 °C in 
a shaking air incubator. When the culture reached 
an OD(600) of 0.4, we transferred it to a 42 °C water- 
bath and shook it gently for approximately 20 minutes. 
After heat induction at 42°C, the flask was trans- 
ferred to a 39°C air incubator (New Brunswick 
scientific. New Jersey) where it was shaken vigorously 
at 250 rpm. We withdrew samples just after the 42 °C 
heat shock, and at hourly time points for 4 hours, 
and then after overnight growth. The samples were 
measured for growth by OD(600) and analyzed following 
SDS-PAGE for the pattern of protein synthesis by 
Coomassie blue protein staining and by Western blot 
analysis with our rabbit an ti pep tide antibody probes 
(described above). Based on the relative molecular 
weight and protein blot analysis, the expression of 
rsT4.2 was induced from SC936/pl99-7 following heat 
induction at 42 °C (Figure 12). 

We transformed pl99-7 into a P L mu.tet 
expression vector, an E . coli expression vector, at 
the unique Clal site (see Figure 11). The nucleo- 
tide and amino acid sequences of pl99-7 are shown in 
Figure 11. 

The expression of soluble T4 from pl99-7 
-in E . coli was measured by western blot analysis of 
whole cell extracts following SDS-PAGE using the 
rabbit polyclonal anti-peptide JB-1 or anti-peptide 
JB-2 antibodies as probes (Figure 12). 

We also constructed expression vector 
p203-S, as shown in Figure 9B, as follows. 



3700 



89085519 



-51- 

We started with p!97-7, which has the same 
sequence as the P L um vector p!97-l- (see Figure 9A), 
except that there is a single nucleotide deletion in 
the 5 1 noncoding region following the P L promoter . 
5 That deletion, which is a deletion of nucleotide 

#40 — adenine — of pl97-12 (see Figure 11), resulted 
from a deletion in the region that was constructed 
from linkers 57-60 (see Figure 10). pl97-7 contains 
the rsT4.2 gene comprising 374 amino acids. Alter - 
10 natively, one could also use p!97-7 as a starting 
plasmid. 

we cut p!97-7 with Clal. We also cut pl95-8 
(see Figures 8D and 9A) with Cla l to remove the 
Cla l * Cla l cassette containing the cDNA sequence of 

IS rsT4 . 2 - Subsequently, we inserted the Clal -Clal 
cassette into p!97-7 to produce pl98-2. 

We then digested pl98-2 with StuI to 
remove 80 amino acids (amino acid 185 to amino acid 
264) of the mature T4 protein coding sequence. Unex- 

20 pected methylation, however, prevented cutting at 

the second Stu I site, so that only the Stu I site at 
amino acid 184 was cleaved. Following ligation, the 
plasmid DNA was transformed into E.coli and we 
examined several plasmid clones for the deletion 

25 using standard procedures. None of those pi asm i ds 
con tain ed the expected Stu I deletion. 

Subsequent DNA sequence analysis of one 
of these plasmids, called p203-5, showed that two 
guanine residues (see amino acids 183 and 184; 

30 nucleotides 818 and 819. of Figure 3) of the Stu I 
recognition sequence had been deleted following 
cleavage due to exonuclease digestion caused by the 
use of exonuclease-contaminated Stu I enzyme. This 
dinucleotide deletion produced a translation frame- 

35 shift following amino acid 182 (glutamine) and intro- 
duced a stop codon six amino acid codons downstream 
from the frameshift (Figure 9C). The unexpected 
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methylation of the second Stul site together with 
the deletion that resulted in a »ev stop codon 
produced a gene encoding a shortened form of recom- 
binant soluble T4, called rsT4.7. The rsT4.7 sequence 
5 encodes a 182 amino acid N-terminal segment of the 
mature T4 sequence followed by, at the C-terminus, 
six amino acids — asparagine-leucine-glutamine- 
histirti ne- serine- leucine — of non-T4 sequence and 
finally by a TAA stop codon. 
10 The expression of soluble T4 from p203-5 

in E.coli was measured by Western blot analysis 
as previously described. 

Expression of T4 and Soluble T4 
Polypeptides In Animal Cells 

15 We inserted both soluble T4 genes and the 

unmodified gene encoding membrane-bound T4 into 
animal expression vector pBG368. More specifically, 
we inserted each of the soluble gene constructs into 
pBG368 under the transcriptional control of the 

20 adenovirus late promoter, to give plasmids pBG377, 
pBG380 and pBG381. We also made two pBG3 12 -based 
constructions, called pBG378 and pBG379, which 
direct the expression of recombinant full length T4 
protein. pBG378 and pBG379 code for the same full 

25 length T4 protein but in pBG379, a portion of the 3 • 

untranslated sequence has been removed. Subsequently, 
to test for expression of recombinant soluble T4 and 
recombinant full length T4, we cotransf ected Chinese 
hamster ovary ("CHO") cells with one of each of 

30 those plasmids and with the plasmid pAdD26. 

We first constructed pBG3 6 8 as follows. 
As depxcted in Figure 13 , we cut animal cell expres- 
sion vector pBG312 [R. Cate et al., "Isolation Of 
The Bovine And Human Genes For Mullerian Inhibiting 

35 Substance And Expression Of The Human Gene In Animal 
Cells", Cell , 45 , pp. 685-98 (1986)] with Eco RI and 
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Bglll to delete one of each of f.e two EcoRI and the 
two Bglll restriction sites (the Eco>il site at posi- 
tion 0 and the Bglll site located at approximately 
position 99). The resulting plasmid. pBG368. retained 
5 an EcoRI site in the cloning region and a Bglll site 
after the cloning region. This left a single EcoRI 
site and a single Bglll site in the polylinxer for 
cloning purposes. 

More specifically, we deleted one EcoRI 

10 site and one Boll I site by sequential partial diges- 
tion of pBG312 with restriction enzymes EcoR I and 
Bglll. respectively. We filled in with Klenow and 4 
nucleotides then religated to produce pBG368, which 
contains unique restriction sites for EcoRI and Bglll 

15 enzymes - 

Once transient expression of soluble T4 
was verified, we constructed stable cell lines that 
continuously expressed soluble T4. To do this, we 
employed the stable cell expression host, the 
20 dihydro folate reductase deletion mutant (DHFR~) 
Chinese hamster ovary cell line [F. Kao et al., 
•Genetics Of Somatic Mammalian Cells X Complementation 
Analysis of Glycine-Requiring Mutants", Proc. Natl. 
Acad. Sci. . 64, pp. 1284-91 (1969); L. Chasin and 
25 G- Uriah "Isolation Of Chinese Hamster Cell Mutants 
Defici ent In Dihydrbfolate Reductase Activity", 
PrBC . Matl. Acad. Sci. , 77. pp. 4216-80 (1980)]. 

Using this system, we cotransfected each 
T4 gene construct with pAdD26 [R. J. Kaufman and 
30 P. A. Sharp, "Amplification And Expression Of 

Sequences Cotransfected with a Modular Dihydro folate 
Reductase Complementary DNA Gene" , J . Mol. Biol . . 
159, pp. 661-21 (1982) containing the mouse DHFR 
gene. Before carrying out the co-trans fections , we 
35 linearized all plasmids by restriction enzyme cleavage 
and, prior to trans fecti on, we mixed each plasmid 
with pAdD26 so that the molar ratio of pAdD26 to T4 
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was 1:10. This maximized the number of T4 gene copies 
per trans fectant. 

Within the cell, the plasmids were ligated 
together to form polymers that can become integrated 
5 into host chromosomal sequences by illegitimate 

recombination [J. Haynes and C. Weissmann, "Constitu- 
tive, Long-Term Production Of Human Interferons By 
Hamster Cells Containing Multiple Copies Of a Cloned 
Interferon Gene", Nucl. Acids Res, , 11, pp. 687-706 
10 (1983); S. J. Scahill et al., "Expression And Charac- 
terization Of The Product Of A Human Immune Interferon 
cDNA Gene In Chinese Hamster Ovary Cells", Proc. Natl. 
Acad. Sci. USA , 80, pp. 4654-58 (1983)). We selected 
trans fee tants that express the mouse DHFR gene in 
15 culture medium lacking nucleotides. We then subjected 
these transfectants to a series of increasing concen- 
trations of methotrexate, a toxic folate analogue 
that binds DHFR, to select for cells levels of DHFR. 

Resistance to methotrexate by increased 
20 expression of DHFR is frequently the result of DHFR 

gene amplification, which can include the reiteration 
of large chromosomal segments, called amplified 
units [R. J. Kaufman and P. A. Sharp, "Amplification 
And Expression Of Loss Of Dihydrofolate Reductase 
25 Genes In A Chinese Hamster Ovary Cell Line", Molec, 
Cell. Biol. , 1, pp. 1069-76 (1981)]. Therefore, 
cointegration of DHFR and rsT4 sequences permitted 
the amplification of rsT4 genes. Stably transfected 
cell lines were isolated by cloning in selective 
30 growth medium, then screened for T4 expression with 
-a T4 antigen (RIA) [D. Klatzmann et al.. Nature, 
312, pp. 767-68 (1984)) and by immunoprecipitation 
from conditioned medium after [ 35 S] cysteine 
(" 35 S-Cys" ) metabolic labelling. 
35 we also inserted the soluble T4 derivative 

rsT4.7 gene into an animal cell expression plasmid 
as follows- 
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As set forth in Figure 14C, we cut plasmid 
pBG381 (Figure 14A) with EcoRI a:.d Nhel . We then cut 
pl86-6 with EcoR I and Nhe l to remove the 786 base 
pair fragment. We ligated that fragment into the 
5 digested pBG381 to form plasmid pBG391. The T4 
sequence in pBG391 is identical to both that of 
Maddon et al. (1985) supra at positions 64 (tryptophan) 
and 231 (phenylalanine) and to that of pBG381. How* 
ever, at position 3, the asparagine reported by 

10 Maddon et al, and present in pBG381 is replaced with 

lysine. The nucleotide sequence of pBG391 is depicted 
in Figure 15. 

We then digested p203-5 with Nhel and 
Ox an I to remove the 483 base pair fragment. We 

15 inserted that fragment into Nhe I /OxanI -digested 
PBG391 to form plasmid pBG392, the animal cell 
expression construct of rsT4.7. The T4 sequence in 
rsT4.7 contains amino acids identical to that of 
Maddon et al . 1 s full length sequence at amino acid 

20 positions 64 (tryptophan) and 231 (phenylalanine). 
However, at position 3, the asparagine reported by 
Maddon et al . is replaced with lysine. The nucleo- 
tide sequence of pBG392 is depicted in Figure 16. 

In Figure 14D, we have depicted the con- 

25 struction of other animal cell expression constructs 
containing sequences encoding the deletions rsT4.9 
in pBC394, and rsT4.12 in pBG396. Those constructions 
were carried out using conventional recombinant tech- 
niques. The linkers employed in those constructions 

30 are set forth in Figure 18. The nucleotide sequences 
of pBG394 and pBG396 are shown in Figures 19 and 20. 

Plasmid pBG393, shown in Figure 17, con- 
tains rsT4.8, the perfect form of rsT4.7. pBG393 
contains 182 amino acids of the mature T4 sequence, 

35 without the additional non-T4 6 amino acids at the 

C-terminus following amino acid 182. The nucleotide 

2 -7 q gseguence of BG393 is shown in Figure 21. 
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Other animal cell expression plasmids 
according to this invention may oe constructed as 
depicted in Figure 17. These include rsT4.10 in 
pBG395 and rsT4.11 in pBG397 (see Figure 18 for 
5 specific linkers ) . 

The nucleotide sequence of BG395 is shown 
in Figure 22. 

Purification Of Recombinant Soluble T4 

Recombinant soluble T4 construct pBG380 

10 expressed in DHFR~ CHO cells was grown to confluency 
in a a -Modified Eagles Medium (Gibco) supplemented 
with 102 fetal calf serum, 1 mM glutamine and the 
antibiotics penicillin and streptomycin (100 pg/ml 
of each). The cells were grown at 37°C in two 21 Cell 

15 Factory Systems (Nunc). We then washed the confluent 
cells free of fetal calf serum with a -Modified Eagles 
Medium without fetal calf serum and cultured the 
cells in o -Modified Eagles Medium at 37°C for 4 days. 
Subsequently , we harvested the conditioned media, 

20 filtered it through a Millipore Minidisk 0.22p 

hydrophilic filter cartridge (Millipore #MCGL 305-01) 
and concentrated the secreted proteins on a fast-S 
ion exchange column (S-Sepharose Fast Flow, Pharmacia 
#17-0511-01) in 20 mM MES buffer (pH 5.5). 

25 We then eluted the bound proteins with 20 mM 

Tris-HCl (pH 7.7) and 0.3 M NaCl. The elution pool 
was subsequently diluted with 2 volumes of 20 mM 
Tris-HCl (pH 7.7) and it was then loaded on a column 
comprising immobilized 19Thy anti-T4 monoclonal anti- 

30 body coupled to Affigel-10 [a gift of Dr. Ellis 
"Reinherz, Dana Farber Cancer Institute , Boston, 
Massachusetts]. We washed the column extensively 
and eluted the bound material as 0.5 ml fractions 
with 50 mM glycine-HCl (pH 2.5), 150 mM NaCl, 0.1 mM 

35 EGTA and 5 M9/ml bovine pancreatic trypsin inhibitor, 
Aprotinin (Sigma 3A1153 ) . We used Western blots 
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developed with' rabbit antisera raised against peptide 
JB-2 to follow the purification. We employed silver 
stained gels to follow binding and elution of rsT4.2 
during the chromatography. Figure 23 depicts a 
5 Coomassie stained gel of purified rsT4.2. 

Gel sizing-column chromatography analysis 
of the purified rsT4. 2 from the pBG380 transfected 
CHO cell line, BG380G, suggests that rsT4 is monmeric 
under physiologic pH and salt concentration. 

10 Sequencing Of Recombinant 
Soluble T4 Protein 

We then determined the N- terminal amino 
acid sequence of a recombinant soluble T4, specifi- 
cally rsT4.2, molecule purified from the conditioned 

15 medium of the pBG380 transfected CHO cell line BCSOC, 
as described above, by automated Edman degradation 
in an Applied Biosystems 470A gas phase sequenator 
[R. B. Pepinsky et al., J. Biol Chem. , 261, 
pp. 4239-46 (1986)). 

20 The amino terminal sequence matched the 

sequence which we had previously determined for 
solubilized native T4 isolated from U937 cells, supra . 
The amino terminal sequences of native solubilized 
T4 (sT4) and purified rsT4 protein are A2 proteins, 

25 as compared to the amino terminal sequence predicted 

by Maddon et al., (1985), supra, with the mature amino 
terminus located at position 3 of that sequence. The 
amino terminal sequences of solubilized native T4 
(sT4), recombinant soluble T4 (rsT4.2) secreted by 

30 CHO trans fectant BG380G containing pBG380 and the 
protein sequence deduced by Maddon et al . (1985), 
supra are as follows: 

ST4 : X-K-V-V-L-X-K-K-X-D-T-V-E-L-T-X-T-A-S-E- 
rsT4 . 2 : N-K-V-V-L-G-K-K-G-D-T-V-E-L-T-X-T-A-S-E- 
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Maddon 

et a i. q-g-N-K-V -V-L-G-K-K-r.-D-T-V-E-L-T-C-T-A-S-E 

In the above sequences, the amino acids 
are represented by single letter codes as follows: 



Phe: 


F 


Leu: 


L 


lie: 


I 


Met: 


M 


Val: 


V 


Ser : 


S 


Pro: 


P 


Thr : 


T 


Ala: 


A 


Tyr : 


y 


Bis: 


B 


Gin: 


Q 


Asn: 


N 


Lys : 


K 


Asp : 


D 


Glu: 


E 


Cys: 


C 


Trp: 


W 


Arg: 


R 


Gly: 


G 



X0 X: not: determined or ambiguous. 

we also constructed pBG211-ll, a plasmid 
coding for the N- terminal 113 amino acids of soluble 
T4 protein. This construct, which codes for a pro- 
tein characterized by a single disulfide bridge, 
15 between the cysteines at amino acid positions 18 and 
86, is conveniently expressed in E.coli. 

To construct p211-ll, as depicted in Fig- 
ure 24, we first cut pl95-8 (see Figures 8D and 9A) 
with Clal to remove the Cla l- Cla l cassette contain- 
20 ing the cDNA sequence of rsT4.2. We then digested 
pAX153?3SHl&AAmp, the tryptophan operon promoter 
pl asmi d from the gamma interferon producing E.coli 
strain BN374 with Cla l , and deleted the cDNA coding 
for gamma interferon. Subsequently, we inserted 
25 the Clal -Clal cassette into the Clal -cut E.coli 

plasmid in front of the tryptophan operon promoter 
and' ligated to produce pl96-10. 

As shown in Figure 25, we then subjected 
pBG380 to oligonucleotide-directed mutagenesis to 
30 insert thr ee tandem translational stop codons follow* 
- ing the T4 cDNA sequence coding for amino acids -23 
to 113 in pBG380, co produce pBG394. 

We then constructed p211-ll from fragments 
of each of pl96-10, pBG394 and p!034 as depicted in 
35 Figure 26. The first fragment including the vector 
sequences, was produced by restricting p!96-10 with 
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Hind i 1 1 and Clal to remove the 14 coding sequence 
from amino acids 61 through 374 of rsT4.2 and includ- 
ing vector sequence following the 3 1 end of the rsT4 
gene. The second fragment, a Hind i I I - Bglll segment 
5 including the codons for T4 amino acids 61-113 of 
rsT4.9 immediately followed by a triplet of stop 
codo ns in tandem, was isolated by Hind i I I /Bgl l I diges- 
tion of pBG394. The third fragment, a BamH I - Cla l 
frag men t containing a bacteriophage T4 transcriptional 

10 termination signal [H. N. Kirsch and B. Allet, 

"Nucleotide Sequences Involved In Bacteriophage T4 
Gene 32 Translational Self-Regulation", Proc. Natl . 
Acad. Sci. USA , 79, pp. 4937-41 (1982)], was isolated 
by BamH I /Cla l digestion of pl034. We then ligated 

15 these three fragments to produce p211-ll, a T4 con- 
struct coding for a 113 amino acid soluble form of 
T4 protein, with asparagine at amino acid position 3 
(i.e., rsT4. 113.1). 

We then subjected p211-ll to oligonucleo- 

20 tide site-directed mutagenesis (Figure 27) to change 
the amino acid at position 3 from asparagine to 
lysine using the oligonucleotide T4-66: 

•t j 

5' AIG CAG GGT AAA 

" I I 

AAA GTA GTA CTG 

GGC 3* . 

This produced plasmid p214-10, a fully 
corrected 113 amino acid soluble T4 vector coding 
30 Tor a 113 amino acid soluble form of T4 protein, 
with lysine at amino acid position 3 (i.e., 
rsT4.113.2). As shown in Figure 27, we subjected 
p214-10 to oligonucleotide site-directed mutagene- 
sis to delete glutamine and glycine at, respectively, 
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amino acid positions 1 and 2 of the T4 sequence using 
the oligonucleotide T4AID-87: 

C 



i 



5 • GTA TCC ATT TGG 
5 ATG ATG AAA AAA 

GTA GTA 3 » . 

This produced p215-7, a 111 amino acid 
soluble T4 construct, including the trp promoter, 
which directs the expression of a 111 amino acid 
10 soluble form of T4 protein, with lysine at amino 
acid position 3 (i.e., rsT4.111). 

we next constructed p218-8, a 111 amino 
acid construct which directs the expression of a 111 
amino acid soluble form of T4 protein, with lysine 
15 at amino acid position 3 (i-e., rsT4.111) under the 

control of the promoter, as depicted in Figure 28. 

More specifically, we cut p!97-12 (Figure 
9A) with Cla l to remove the 101 bp fra gmen t contain- 
ing linker and terminator sequences. We also cut 
20 p215-7 with Cla l to remove the Cla l - Cla l cassette 
con taining the cDHA sequence of rsT4.111 and the #T4 
transcriptional terminator sequence f Kirsch and Allet , 
supra] . Subsequently, we inserted the Cla l • Cla l 
cassette into the Cla l -cut pl97-12 to produce p218-8. 
25 In order to express rsT4. 113.1, we trans- 

formed E.coli A89 with p211-ll by conventional 
techniques [Maniatis et al. (1982), supra] to form 
E.coli A89/p211-ll. E.coli A89 is a tetracycline 
sensitive derivative of E.coli SG936. We isolated 
30 E.coli A89 from E.coli SG936 according to the method 
of S- R. Maloy and W. D. Nunn, ,f Selection For Loss 
Of Tetracycline Resistance By Escherichia coli " , 
J. Bact. , 145, pp. 110-12 (1981), which is based 
upon the ability of the lipophilic chelating agent 
35 fusaric acid to selectively inhibit resistant strains. 
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More specifically, plated E^coli SG936 on medium 
containing, per liter, 5 g tryptone. 5 g yeast extract, 
10 g NaCl, 10 g NaH 2 P0 4 -H 2 0, 50 mg chlortetracycline- 
HC1, 12 mg fusaric acid, 0.1 mM 2nCl 2 and 15 g agar. 
5 Colonies which grew at 30 °C (putative tetracycline- 
sensitive strains) were retested for tetracycline 
sensitivity on L-agar plates containing 5 »g/ml 
tetracycline. One tetracycline- sensitive strain, 
designated A89, was then shown to be unable to grow 
10 on LB agar at 42 °C. thus verifying the presence of 
the htpR mutation. 

Trans formants were selected by tetracycline 
resistance, we picked a single colony into 20 ml of 
minimal medium plus 0.2% cas amino acids plus trypto- 
15 phan (100 pg/ml) plus tetracycline (10 pg/ml) in a 

100 ml shaJce flask placed in a shaking air incubator 
at 30°C and allowed the cells to grow up overnight. 
The following morning, we inoculated 40 ml of mini- 
mal medium plus 0.2% cas amino acids plus tryptophan 
20 (100 pg/ml) plus tetracycline (10 pg/ml) with the 

overnight culture at OD 6Q0 = 0.05 in a 500 ml flask. 
The cells were grown to midlog phase and th en induced 
by pelleting, washing once in minimal medium and 
then resusp ending in minimal medium plus 0.2% cas- 
25 ami no acids plus tetracycline (10 pg/ml), in the 
absence of tryptophan. We removed 0.6 OD 6Q0 of 
cells after 0, 1, 2, 3 and 4 hours incubation and 
after growth overnight. 

The aliguots were centrifuged and cell 
30 pellets were subjected to lysis by boiling in 

Laemmli gel loading buffer. After centrifugation to 
remove cell debris, half o^ each sample was subjected 
to SDS-PAGE, followed by Western blot analysis with 
our rabbit antipeptide antibody probes or by Coomassie 
35 blue protein staining (Figures 29A and 29B). 
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Purification Of rsT4. 113.1 

We then purified rsT4. 113.1 from the E.coli 
trans formant by means of two essentially quantitative 
steps involving ani on-exchange and gel- filtration 

5 chromatographies performed under reducing and dena- 
turing conditions • 

More specifically, we suspended 14 g of 
we t: cells from a 4 L shake-flask fermentation in 
100 ml of a 20mM Tris (pH 7.S) buffer containing 

10 20 vg/ml DNase, 20 pg/ml RNase and 1 mM phenylmethyl- 
sulfonylfluoride ( "PMSF" ) . The suspension was applied 
to a French Press at 1000 psi in two passages and 
then centrifuged in an SA 600 rotor at 18,000 g for 
15 min at 4*C. The resulting pellet was solubilized 

15 in 20 ml of a 20 mM Tris (pH 7.5) buffer conta inin g 
7 M urea and 10 mM 2-mercaptoethanol. We then sub- 
jected the suspension to ultracentrifugation at 
85,000 g for 90 min at 4°C. The supernatant was 
diluted by the addition of 80 ml of 20 mM Tris 

20 (pH 7.5) buffer containing 7 M urea and 10 mM 
2-mercaptoethanol and 40 ml of the sample was 
applied to a 3 x 4 cm Q-Sepharose fast-flow column 
(Sigma, St. Louis, Missouri) which had been pre- 
equilibrated in the same buffer. The column was 

25 developed with a gradient in 400 ml total volume of 

increas ing Nad from 0 to 0.3 M in the same Tris/urea/ 
2-mercaptoethanol buffer. Column fractions were 
monitored for absorbance at 280 run and for protein 
content by SDS-PAGE (15% acrylamide). The fractions 

30 were also analyzed by Western blots. Figure 30, 

panel (a) is a chroma togram displaying the purifi- 
cation of rsT4. 113.1 by ion-exchange chromatography. 
In that figure, peaks containing rsT4. 113.1 are 
identified. The rsT4. 113.1 was found to elute early 

35 in the NaCl gradient and to be well-resolved from 
low-molecular weight contaminants . 
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In order to separate rsT4. 113.1 from high- 



molecular weight contaminants, we carried out gel- 
filtration chromatography on an rsT4. 113 .1 -containing 
pool for final purification of the protein to near 
5 homogeneity (>95% purity). More specifically, we 

prepared a pool containing 20 mg of protein in 50 ml 
and then concentrated to 10 ml in a stirred-cell 
ultrafiltration unit (Amicon, Danvers, MA. ) using a 
PM-30 membrane (Amicon). Subsequently, 5.0 ml of 

10 the concentrate was applied to a 1.5 x 95 cm S-300 
column (Sigma) equilibrated and developed in the 
same Tris/urea/2-mercaptoethanol buffer. We moni- 
tored the column fractions for absorb ance at 280 nm 
and for protein content by SDS-PAGE. The fractions 

15 were also analyzed by Western blots. A pool con- 
taining rsT4. 113.1 (approximately 4 mg) in 15 ml was 
thus prepared. Figure 30, panel (b) is a chroma to- 
gram displaying the purification of rsT4. 113.1 by 
gel -filtration separation of the rsT4. 113.1 pool. 

20 In that figure, peaks containing rsT4. 113.1 are 
identified. 



depicting the purification of the rsT4 derivative 
throughout the centrifugation and chromatography 
25 steps. In Figure 30, panel (c), the lanes depicted 



Figure 30, panel (c) is an SDS-PAGE analysis 



are 



30 



35 



lane A: 



lane B: 



lane C: 



lane D: 



lane E: 



molecular weight standards 
cell extracts 

cell pellet following solubilization 
of cell extract in non-denaturing 
conditions 

supernatant following solubilization 
of cell extract in non- denaturing 
buffer 

supernatant following ultracentri- 
fugation step 
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lane F: Q-Sepharose pool 

lane G: S-300 gel- filtration pool. 

Refolding Of Purified rsT4. 113.1 

We refolded the purified rsT4. 113.1 by 
5 dilution and dialysis steps to non-denaturing and 
oxidized conditions. More specifically, refolding 
of the protein at a concentration of 0.5 OD (280)/ml 
wa s achieved by stepwise dialysis against 500 volumes 
of 3 M urea, 20 mW Tris (pB 7.5); 500 volumes of 1 W 

10 urea, 0.1 M ammonium acetate (pH 6.8) and, finally, 
the same volume of a phosphate-buffered saline solu- 
tion. Throughout the refolding procedure, samples 
of the protein were monitored for relative content 
by spectral analysis and by high-performance liquid 

15 chromatography ( *HPLC" ) performed on a 150A liquid 
chromatographic system (Applied Biosys terns. Inc., 
Foster City, California). An octasilyl column 
(Aquapore RP-300, 0.46 x 3.0 cm) was equilibrated in 
80% 0.1% trifluoroacetic acid ("TFA" ) /water (sol- 

20 vent A) and 20% 0.085% TF A/70% acetonitrile (sol- 
vent B) and developed with a linear gradient of 
increasing acetonitrile concentration from 20% to 
80% (solvent B) over 45 min at a flow rate of 
0.5 ml /min. 

25 As shown in Figure 31, panel (a), protein 

in 7 M urea* 10 mM 2-mercaptoethanol and 20 mM 
Tris(pH 7.5) eluted from the HPLC column at 49% 
acetonitrile in the gradient. In subsequent steps, 
. from 1 M urea/1 mM ammonium acetate (pH 6.8) [Fig- 

30 - ure 31, panel (b)] to phosphate buffered saline 

[Figure 31 , panel (c)], an increasing percentage of 
rsT4. 113.1 was found to elute earlier in the HPLC 
gradient — at 47% acetonitrile. The identity of 
the earlier eluting peak as oxidized product was 

35 verified by reduction of rsT4. 113.1 in non-chaotropic 
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solutions and application of sample thus treated to 
HPLC under the sane conditions. 

The elution of oxidized rsT4. 113.1 prior to 
reduced protein on HPLC suggests that formation of 
5 the single disulfide bridge decreases relative hydro- 
phobicity of the protein (J. L. Browing et al-. Anal. 
Biochem. , 155, pp. 123-28 (1986)]. Spectral analysis 
of rsT4 . 113 . 1 was performed throughout the course of 
refolding in order to monitor relative yield of solu- 
10 ble protein in the procedure. The refolding method 
allowed approximately 20% recovery of rsT4. 113.1. 
HPLC analysis indicated a less than 15% contaminant 
of reduced protein in the preparation (Figure 30 , 
panel (c), lane G). 

15 sequencing Of Renatured rsT4.113 

we then carried out amino acid analysis of 
rsT4 . 113 • 1 by automated Edman degradation in an 
Applied Biosystems 470A gas phase seguenator equipped 
with a 900 A data system. Phenyl thiohydanti on amino 

20 acids generated during the course of the degradative 
chemistry were analyzed on-line using an Applied 
Biosystems 120A FTH-analyzer equipped with a PTH-C18 
2.1 x 220 mm column. Protein (10 pg) for sequence 
analysis was applied to SDS-PAGE (15% acrylamide) 

25 and electroblotted on an Immobilon membrane (Millipore 
Corp., Bedford. Massachusetts) as described by 
P. Matsudaira, J. Biol. Chem. , 262, pp. 10035-38 
(1987). 

Amino acid analysis of protein samples was 
30 performed by hydrolysis of protein in 6 N HC1, in 
vacuo, for 24 h at 110°C. The hydrolysates were 
then applied to a Beckznan 6300 Analyzer equipped 
with post-column detection by ninhydrin. Western 
blot analysis of the SDS-PAGE gels was carried out 
35 by standard techniques using rabbit antisera JB-1. 
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Sequence analysis repealed an amino terminal sequence 
of: Met-Gln-Gly-Asn-Lys-Val-Val 

The purified rsT4. 113.1 protein was found 
to contain stoichiometric quantities of the amino- 
5 terminal methionine placed in the protein construct 
for expression in E.coli and an intact polypeptide 
c hain consistent with a sequence derived from the 
plasmid construction. Recovery of phenyl thiohydan- 
toinyl -methionine at the first cycle of the degrada- 
10 tive chemistry was 60% consistent with routine initial 
yields obtained in the automated Edman. This obser- 
vation excludes the possibiity that a significant 
percentage of the rsT4. 113.1 lacked the initiation 
methionine, i.e.. the HHj -methionine was not removed 
15 by expression of rsT4. 113.1 in E.coli . or that sequence 
analysis was impaired by the presence of glutamine 
at the first cycle of the degradative chemistry. 
Sequence analysis was performed for 40 cycles and no 
evidence of lysine carbamylation was observed. Amino 
20 acid analysis displayed a close correlation of actual 

and theoretical values for amino acids, thus indicating 
the marked absence of proteolytic degradation in the 
course of expression, or purification, or both. 

Immunoprecipitation Of CHO Cell 
25 r.ines Producing Soluble T4 

* ' 35 

We te sted the conditioned media from S-Cys 

metabolically labelled CHO cells trans fected with 

one of the T4 mutant constructs pBG377, pBG380. pBG381, 

the full length recombinant T4 construct pBG379, of 

30 this invention or vector only, to determine whether 
- any produced a molecule recognized by the anti-T4 
monoclonal antibody 19 Thy. To carry out this test, 
we incubated about 10 7 CHO cells transfected with 
either pBG380, pBG381. pBG377. pBG379 or pBG312. for 

35 5 hours at 37°C with 180 M Ci/ml 35 S-labelled cysteine 
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[DuPont, New England Nuclear] in 4 ml RPMI cys~ medium 
(Gibco). After labelling of the cells, 1 ml of fil- 
tered, conditioned media was made 0.5 mM with phenyl - 
methyl-sulphonyl fluoride and immunoprecipitated 
5 with OKT4 and protein A Sepharose (P. H. Sayre and 
E. L. Reinherz, Eur. J. Immunol . , 15, pp. 291-95 
(1985)]. Subsequently , we incubated media from the 
35 S-labelled cells with OKT4 (ATCC #CRL 8002). We 
r h^n immuno-precipitated with protein A Sepharose 

10 and subjected the immuno-precipitates to SDS-PAGE 

un der reducing conditions on 10% polyacryl amide gels 
[U. K. Laemmli, Nature , 227, pp. 680-85 (1980)). 
Autoradiography was carried out with X-Omat X-ray 
film (Eastman Kodak). 

15 As shown in lanes 3-5 of Figure 32, both 

pBG380 (rsT4.2) and pBG381 (rsT4.3) directed the 
synthesis of a secreted, immune, 35 S-labelled T4 
protein that was recognized by the OKT4 anti-T4 
antibody. The immunoprecipitated truncated mole- 

20 cules migrated as 49 Kd proteins, a result consis- 
tent with their predicted molecular weights. In 
contrast, no soluble T4 antigen could be detected in 
the conditioned media of cell lines stably trans - 
fected with pBG377 (rsT4.1) or pBG379 (rflT4). 

25 immunoprecipitation analysis of cellular extracts of 
cell lines trans fected with pBG377 suggests that the 
rsT4.1 gene may be misfolded, which could account 
for a block in its secretion (M. J. Gething et al.. 
Cell , 46, pp. 939-50 (1986)). 

30 In Figure 32, the lanes represent the 

following: Lane 1 ; immunoprecipitation from condi- 
tioned medium of CEO cells stably co- trans fected 
with vectors pBG312 and pAdD26 . Lane 2 ; blank. 
Lanes 3 and 4 : immunoprecipitation from conditioned 

35 medium of CHO cells stably co-trans fected with pBG380 
(rsT4.2) and pAdD26. Lanes 5 and 6 : immunoprecipita- 

|71l? tion from conditioned medium of CHO cells stably 
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co-transfected with pBG381 (rsT4.3) and pAdD26. 
Lane 7 : immunoprecipitation from conditioned medium 
of CHO cells stably co- trans fected with recombinant 
full length T4 (pBG379) and pAdD26. In Figure 32, 
5 the arrow indicates the predicted position of the 
soluble T4 from pBG380 or pBG381 relative to the 
migration of standard molecular weight markers. 

I znxnunop r ecip i ta ti on Of COS 7 Cell Lines 
Producing Recombinant Soluble T4 

10 We expressed recombinant soluble T4 

derivatives pBG392, pBG393 and pBG394 in COS 7 cells 
by el ectroporation , essentially as described by 
G. Chu et al. f "Electroporation For The Efficient 
Trans fecti on Of Mammalian Cells with DNA* , Nuc. 

15 Acids Res, , 15, pp. 1311-26 (1987) . More specifi- 
cally, we introduced 20 pg closed circular plasmid 

DNA and 380 pg of carrier (sonicated salmon sperm 

7 

DNA) into 3 x 10 COS 7 cells. The cells were 
electroporated using a Gene Pulser (Biorad) set at 

20 300 volts. Subsequently, we incubated the COS 7 

cells in Dulbecco's Modified Eagle's Medium supple* 
men ted with 10% fetal calf serum for 24 hours. We 
*-H«»n harvested the conditioned media, filtered it' 
thr ough a Millipore Minidisk 0.22 p hydrophilic 

25 filter cartridge (Millipore #MCCL 305-01) and 

concentrated the secreted proteins on a fast-S ion 
exchange column (S-Sepharose Fast Flow, Pharmacia 
#17-0511-01) in 20 mM MES buffer (pH 5.5). 

We then eluted the bound proteins with 

30 20 mM Tris-HCl (pH 7.7) and 0.3 M NaCl. The elution 
pool was subsequently diluted with 2 volumes of 20 mM 
Tris-HCl (pH 7.7) and it was then loaded on a column 
comprising either 19Thy anti-T4 monoclonal antibody 
and protein A Sepharose or OKT4A and protein A 

35 Sepharose. We washed the column extensively and 

eluted the bound material as 0.5 ml fractions with 
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50 mM glycine-HCl (pH 2,5), 150 mM NaCl, 0.1 mM EGTA 
and 5 pg/»l Bovine pancreatic trypsin inhibitor, 
Aprotinin (Sigma, #A1153). The immunoprecipitates 
were subjected to SDS PACE (10% gel) followed by 
5 inununoblotting against rabbit antisera raised 

against peptide JB-1. We employed silver stained 
gels to follow binding and elution of rsT4 during 
chromatography • 

Figure 33 depicts an jjnmunoblot analysis of 

10 transiently expressed pBG392 (rsT4.7) [lanes 10, 
11]; pBG393 (rsT4.8) [lanes 4, 7, 8] and pBG394 
(rsT4.9) [lane 5]. The standards are 50 ng purified 
rsT4.3 (lane 1); ISO ng purified rsT4.3 (lane 2) and 
250 ng purified rsT4.3 (lane 3). The arrow indicates 

15 the expected position of migration of a protein with 
the relative molecular weight of rsT4 . 7 : 21,000 
daltons. The sample that was to be loaded into lane 4 
was lost and lanes 6 and 9 are blank. 

As shown in lanes 10 and 11 of Figure 35, 

20 pBG392 (rsT4.7) directed the synthesis of a secreted, 
immune protein that was recognized by the anti-T4 
antibodies OKT4A and 19Thy. Lanes 4, 7 and 8 also 
demonstrate that pBG393 (rsT4.8) directed the 
syn th esis of a secreted, immune protein that was 

25 recognized by OKT4A and 19Thy. This analysis 

illustrates that rsT4 . 7 contains the OKT4A epitope . 
It also suggests that the binding region for HIV 
envelope binding resides in the amino 182 terminal 
residues of T4- 

30 In contrast, no soluble T4 could be detected 

in the media of cell lines transfected with pBG394 
~(rsT4.9) [see lane 5). Immunoprecipitation analysis 
of cellular extracts of cell lines transfected with 
pBG397, however, showed that rsT4.9 was recognized 

35 by OKT4A. We believe that rsT4.9, a 113 amino acid 

construct, binds the HIV virus and that it represents 
a second generation soluble T4, one with only two 
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cysteines and one of three disulfide bridges. 
Accordingly, rsT4.9 is easily produced in E.coli or 
yeast systems* 

Similarly, although no soluble T4 could be 
5 detected in the media of cell lines trans fected with 
pBG396 (rsT4.12), analysis of cellular extracts of 
those cell lines showed that rsT4.12 was recognized 
by OKT4A. Thus, rsT4 . 12 may also bind HIV virus. 

Radioimmunoassay And Epitope 
10 Analysis Of rsT4.113 

In order to determine if the 113 fragment 
of rsT4 contained structural determinants for binding 
to OKT4A, Leu-3A and OKT4, we then carried out 
radioimmunoassay and epitope analysis of rsT4.113 

15 using a competitive inhibition radioimmunoassay 

[C. J. Newby et "Solid-Phase Radioimmune Assays" 

in Handbook Of Experimental Immunology , D. M. Weir 
(Ed.). 1# pp. 34.1-34.8 (1986)). As OKT4A and Leu-3A 
block infectivity of HIV in vitro f Dalgleish et al . , 

20 supra] and binding of T4 to gpl20/160 [ McDougal et al. , 
supra), this analysis served as a first approximation 
as to whether or not rsT4.113 contained structural 
elements for interaction with HIV. 

We first coated U-bottom 96 well microti ter 

25 plates (Falcon) with 50 p I/well goat-anti -mouse IgG 
(Hyclone Typing Kit, Logan, Utah) in PBS (pH 7.0) to 
a concentration of 50 vq/ml and incubated the plates 
overnight at 4°C. We then rinsed the plates with 
IX PBS and blotted them dry. The plates were t h e n 

30 - blocked by the addition of 100 pl/well of a IX PBS 
solution containing 5% bovine serum albumin for 
1 hour at room temperature . We rinsed the plates 
with PBS, blotted dry and then spotted them with 
50 pi of one of three antibody solutions containing 

35 either OKT4 (10 pg/ml in block buffer); OKT4A 
(500 ng/ml in block buffer) or Leu-3A (Becton- 
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Dickinson) (500 ng/ml in block buffer). We let the 
plates stand for 2 hours at room temperature . We 
then washed the plates 3 times with a PBS/0.05% 
Tween-80 solution and 2 times with IX PBS and blotted 
5 them dry. 

In a separate plate, we titrated competitor 
samples of unlabeled rsT4.313.1 from 20 \*q/ml and 
serially diluted twice (including no competitor con- 
trol), with final volumes in each well of 25 pi. 

10 The positive control for this assay was competition 
with unlabeled rsT4.3 (375 amino acids). We then 
added 25 pi of A4 "l-rsT4.3 containing 10,000 
cpm/25 pi (prepared according to A. E. Bolton and 
W. M. Hunter, Radioimmunoassay And Related Methods , 

15 Chapter 2c). Subsequently, we spotted the entire 

50 m! content of each well onto the assay plate con- 
taining each of the three antibody solutions and 
incubated for 2 h at room temperature. We then 
washed the plates 3 times with a PBS/0.5% Tween-80 

20 solution and 2 times with IX PBS, blotted them dry 

and then counted the wells in a Beckman gamma counter 
for radioactivity. 

As shown in Figure 34, rsT4. 113.1 competes 
with 125 I-rsT4.3 for absorption to an OKT4A solid 

25 phase in a dose-dependent manner . Additionally, 

125 

rsT4. 113.1 competes with I-rsT4.3 for absorption 
to a Leu-3A solid phase in a dose-dependent manner. 
By comparison to unlabeled rsT4.3, rsT4. 113.1 exhibits 
a molar affinity for those antibodies within a factor 
30 of 3. In the 0.4 to 25 pg/ml concentration range 

tested, rsT4.113 did not compete with radiolabelled 
rsT4.3 for binding to OKT4. In a similar assay, we 

125 

observed that rsT4.111 also competes with I-rsT4.3 
for binding to OKT4A and Leu-3A, but not to OKT4 
35 [Figures 35-37] . 

Based on these results, we believe that 
the epitopes for OKT4A and Leu-3A are contained within 
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the amino- terminal 11^ amino acids of T4. We also 
believe that the epitope for OKT4 binding is localized 
within the carboxy terminal of the T4 polypeptide. 

Accordingly, we believe that the gp!20- 
5 binding domain is localized within the amino terminal 
113 or 111 amino acids of the T4 protein- Based on 
this belief, we synthesized various synthetic oligo- 
peptides which contain sequence within that structural 
domain. These oligopeptides are represented in 
10 Figure 3 as follows: 

Oligopeptide Amino Acid Coordinates 

JB-1 44-63 
rsT4 #6 18-29 
rsT4 #7 5-56 
15 rsT4 #8 84-97 

rsT4 #9 30-63 
We synthesized these peptides using conventional 
phosphoamide DNA synthesis techniques [Tetrahedron 
Letters , 22, pp. 1859-62 (1981)]. We synthesized 
20 the peptides on an Applied Biosystems 380A DKA 

Synthesizer and- purified them by gel electrophoresis. 

ELISA Assay For rsT4.113 

we also carried out an ELISA assay for 
rsT4. 113.1 produced by p211-ll-trans formed E.coli . 
25 Throughout this assay, dilutions were made in block- 
ing solution and, between each step, we washed the 
plates with PBS/0. OS* Tweeh-20. More specifically, 
we coated wells of Immulon 2 (Dynatech, Chantilly, 
Virginia) plates with .005 OD (280 nm)/ml of OKT4 

30 (IgG2b) in 0.05 M bicarbonate buffer to a volume of 
50 pl/well and incubated the plates overnight at 
4°C. We then blocked the plates with 5% bovine 
serum albumin in PBS, 200 M l/well, and incubated for 
30 minutes at room temperature. 

35 subsequently, we added 50 pi of 50 ng/ml 

rsT4.3 to each well, incubating overnight at 4°C. 
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We then added 50 of a mixture containing 

rsT4.113.1 and 10 ng/ml of OKT4A and incubated for 
2 1/2 hours at room temperature. Using a Byclone 
Kit (Hyclone), we then carried out the following 
S steps. First, we added 1 drop of rabbit anti-mouse 
IgG2a to each well and incubated the plates for 
1 hour at room temperature. We then added 100 pi of 
peroxidase-labeled anti-rabbit IgG, diluted 1:4000 
with blocking buffer to each well, and incubated for 

10 1 hour at room temperature. 

We prepared a substrate reagent as follows. 
We diluted substrate reagent 1:10 in distilled water 
and added two O-phenyl -ethylene-diamine ( "OPD" ) 
chromophore tablets per 10 ml of substrate. We let 

15 the mixture dissolve thoroughly by mixing with a 

vortex. Alternatively, a TMB peroxidase substrate 
system (Kirkegaard & Perry Catalogue #50-76-00) may 
be used. Subsequently, we added 100 \xl of the 
chromophore solution to each well, incubated for 

20 10-15 minutes at room temperature and then stopped 
the color development with 100 pi of IN B^SQ^. We 
then measured OD at 490 nm, using an EXISA plate 
reader . 

The results of the assay are demonstrated 
25 in Figure 38. 

We then subjected the soluble T4 proteins 
produced by the T4 constructs of this invention to 
various functional assays. 

Assays Of The Antiviral Activity Of Soluble T4 

30 The antiviral activity of soluble T4 

according to this invention was evaluated using 
modifications of various in vitro systems used to 
study antiviral agents and neutralizing antibodies 
[D. D. Ho et al., "Recombinant Human Interferon Alpha 

35 (A) Suppresses HTLV-III Replication In Vitro ", 
37 23 Lancet , pp. 602-04 (1985); K. Hartshorn et al . , 
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"Synergistic Inhibition Of HTLV-xII Replication 
In Vitro By Phospbono formate And Recombinant Inter- 
feron Alpha-A", Antimicrob Ag Chemoth , 30, pp. 189-91 
(1986)]. 

5 For each of these assays, we prepared graded 

concentrations of soluble T4 and preincubated them 
with an H9 derived 1 1 IB isolate of HIV [a gift from 
Drs! M. Popovic and R. Gallo, National Cancer 
Institute, Bethesda, Maryland]. The isolate was 

10 maintained as a chronically infected culture in H9 
cells. Cell-free HIV stocks were obtained from 
supernatant fluids of HTLV-III infected H9 cultures 
( culture conditions : 1 x 10 6 cells/ml with 75% viable 
cells ) . we prepared serial 10 fold dilutions of 

15 recombinant soluble T4 ranging from 10 picograms/ml 
to 10 micrograms/ml and incubated them with fifty 
50% tissue culture infectious doses (TCID 5Q ) of HIV 
fcr 1 hour at 37°C, in RPMI-1640 supplemented with 
20% heat inactivated fetal calf serum (FCS). We 

20 then added 150 pi of H9 cells to a final concentra- 
tion of 0.5 x 10^ cells/ml which were not HIV-infected 
to the wells containing aliguots of the recombinant 
soluble T4/HIV mixture. 

We adjusted each virus inoculum to a con- 

25 centration of 250 TCID S0 /ml. We preincubated 100 jj1 
of the virus inoculum with 200 pi recombinant solu- 
ble T4 or 100 pi immunoglobulin prepared in tripli- 
cate serial 2-fold dilutions for 1 hour at 37 °c 
prior to inoculation onto 1.5 - 2 x 10* H9 cells in 

30 5 ml RPWI 1640 supplemented fetal calf serum (20%), 
HEPES ( lOmM ) , penicillin (250 U/ml ) , streptomycin 
(250 pg/ml) and L-glutamine (2mN). On days 5, 6, 7, 
10 and 14, we examined each culture for characteris- 
ic cytopathic effects ( "CPE" ) - Neutralization was 

35 defined as the inhibition of syncytia formation com- 
ared with controls . 
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The positive control us*d was HIV seroposi- 
tive neutralizing serum, as described in D. D. Ho 
et al., "Human Immunodeficiency Virus Neutralizing 
Antibodies Recognize Several Conserved Domains On 
5 The Envelope Glycoproteins", J . Virol . , 61 , 

pp. 2024-28 (1987). The negative controls used 
were HIV seronegative serum only and buffer only. 

ry+*<pa+Ki<r Effect Assay (CPE) 

In thi s assay , following conventional 
10 protocols for cytopathic effect assays f Klatzmann 

et al. (1984), supra and Wonq-Staal and Callo (1985), 
supra], ve microscopically examined the B9 cells for 
evidence of cytopathic effects of HIV. 

The CPE was scored on a four point scale 
15 from 1+ to 4+, with 4+ representing the highest 
degree of CPE. 

By day 14, wells containing recombinant 
soluble T4 according to this invention (rsT4.2, 
derived from the pBG380 transfected CHO cell line 
20 BG380) at 10 nq/ml showed no evidence of CPE, while 
the negative control showed 1* to 3* CPE. 

p24 Radioimmunoassay 

We then tested soluble T4 as an inhibitor 
of viral replication in an HIV virus replication 

25 assay- according to D. D. Ho et al., J. Virol. , 61, 
pp. 2024-28 (1987) and J. Sodroski et al.. Nature , 
322. pp. 470-74 (1986). We carried out the assay 
essentially as described, except that the cultures 
were propagated in microtiter wells containing 

30 200 pi- I* 1 this assay, we evaluated the ability of 
the soluble T4 polypeptides of this invention to 
block HIV replication, as measured by HIV p24 
antigen production. We sampled supematants twice 
weekly for HIV p24 antigen as described below. 
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Wc obtained an assay kin [HTLV-III p24 
Radioimmunoassay System, Catalogue Nc NEK-040, 
NEK-040A, Biotechnology Systems, New Research 
Products, DupontJ which contains affinity purified 
5 I labelled HIV p24 antigen, a rabbit anti-p24 

antibody and a second goat anti-rabbit antibody which 
is used to precipitate antigen- antibody complexes. 
We carried out the assay according to the protocol 
included with the kit. Accordingly, we mixed a sample 

10 to be assayed or one of a series of amounts of 

unlabelled p24 antigen with a fixed amount of 125 1 
labelled p24 and a fixed limited amount of rabbit 
anti-p24 antibody. We incubated the samples over- 
night at room temperature and then added a goat 

15 anti-rabbit immunoglobulin preparation for 5 minutes 
at 40*C. We centrifuged the samples in a micro fuge 
and aspirated the supernatant fluid. Pelletted 125 1 
labelled p24 was quantitated for each sample by gamma 
counting and a standard curve for the I p24 dis- 

20 placed by the known amounts of antigen added to 

standard tubes was constructed. We then calculated 
125 

the I labelled p24 displaced by the antigen present 
in the unknown samples by interpolation using the 
standard curve constructed from the known amounts of 
25 p24 antigen contained in the standard samples. The 
results are shown in the table below. 
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D24 ASSAY OF F'V REPLICATION INHIBITION 



Day 



10 



15 



20 



25 



30 



10 



14 



rsT4.2 
(uq/ml) 


Patient 
Serum 


Average 
CPM 


% Bound/ 
Unbound 


0.5* 
c n** 


Negative 
Positive 


344 
2.237 

551 
1,766 


8.5 
112.4 
19-9 
86.6 


0.5* 
5.0** 


Negative 
Positive 


230 
2,459 

322 
1,980 


2.2 
124.6 
7.3 
96.3 


0.5* 
5.0** 


Negative 
Positive 


221 
2,284 

246 
1.988 


1.8 
115.0 
3.1 
98.7 



These results demonstrate that soluble T4 
according to this invention at a concentration of 
5 vg/ml completely inhibits virus replication as 
measured in this standard 14 day assay. These 
results are also depicted in Figure 39 in graphic 
form. In Figure 39, values were calculated from a 
s tandar d curve of p24 according to assay kit 
instructions . 



* This concentration was initially believed to be 
1-0 pg/ml, based upon our preliminary approximation 
that 1 unit of absorbance at 280 nm ( "A^oo* ) - was 
equivalent to 1 mg of rsT4.2- AbsorbanCS at 280 nm 
is a commonly used first approximation of protein 
concentration. Upon amino acid analysis of the pro- 
tein, however, we found that it had a higher extinc- 
tion coefficient than originally approximated, with 
1 A-o 0 unit of rsT4.2 being equivalent to 0.5 mg of 
the^protein. 

35 ** This concentration was initially beliaved to 

-be 10 »g/ml, based upon our preliminary approximation 
that 1 uni t of absorbance at 280 nm <" A 280 M *' was 
equivalent to 1 mg of rsT4.2. AbsorbanCS at 280 nm 
is a commonly used first approximation of protein 

40 concentration- Upon amino acid analysis of the pro- 
tein, however, we found that it had a higher extinc- 
tion coefficient than originally approximated, with 
1 A- fl0 unit of rsT4.2 being equivalent to 0.5 mg of 
the protein. 
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We then carried out a j.24 replication assay 
as described above, except that the soluble T4 was 
added to the infected cultures during re feeding at 
days 3, 7 and 10, in order to maintain a constant 
5 rsT4 concentration throughout the infection period. 
The results of this assay are shown in the table 
below. 

INHIBITION OF HIV REPLICATION 
WITH CONSTANT CONCENTRATION OF rsT4 



10 rsT4.2 p24 

(tig/ml) (nq/ml) 

0.008 770 

0.031 970 

0.125 85 

15 0.5 0 

5.0 0 

0 1120 

uninfected 0 



These results demonstrate that when solu- 
20 ble T4 protein according to this invention was main* 
taine d at a constant concentration throughout the 
infection period, as little as 0.125 pg/ml of the 
protein substantially blocked replication of 250 
TCID 50 /ml of HIV-1. 
25 Advantageously, soluble T4 protein accord- 

ing to this invention, at concentrations far exceed- 
ing those required to block viral replication, did 
not exert ixnmuno toxic effects in vitro , as measured 
by thr ee lymphocyte proliferation assays — mixed 
30 lymphocyte response, phytohemagglutinin. and tetanus 
toxoid stimulated response. 

Syncytia Inhibition Assay 

To further assess the effect of soluble T4 
on HIV env-T4 binding, we evaluated the effect of two 
35 preparations of our soluble T4 protein on the syn- 
cytiagenic properties of HIV in the co-cultivation 
assay. We carried out a C8166 cell fusion assay 
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as described in B. D. Walker et p 1 .., Proc, Natl. 
Acad. Sci. USA . 84, pp. 8120-24 <198<). 

We incubated 1 x 10 H9 cells chronically 
infected with HTLV-IIIB for 1 hour at 37°C in 5% 
5 C0 2 with various concentrations of one of two 

preparations of rsT4.2 in 150 pi RPMI-1640 media 
supplemented with 20% fetal calf serum. We then 
added 3 x 10 4 C8166 cells in 50 pi media (a T4* 
transformed human umbilical cord blood lymphocyte 

10 line f Sodroski et al . , supra] , to a final volume of 
0,2 ml in each well. Final well concentrations of 
soluble T4 were 0.5 pg/ml* and 5.0 pg/ml* for prepa- 
ration #1 and 1.25 pg/ml* and 12.5 pg/ml* for prepa- 
ration #2* We then counted total number of syncytia 

15 per well at 2 hours and 4 hours after adding the 

C8166 cells at 37°C in 5% C0 2 . Parallel co-cultiva- 
tions used buffer alone ( negative control ) or OKT4A 
at 25 pg/ml (positive control) as controls. We con- 
sidered a positive result as a 50% reduction in 

20 syncytia compared to controls, at a time when at 

4 

least 100 syncytia per 10 infected H9 cells were 
present in the control cultivations. The results of 
this assay are shown below and in Figure 40 (2 hour 
data). 



25 



* These concentrations were initially believed 
to be, respectively, 1 pg/ml, 10 pg/ml, 2.5 pg/ml 
and 25 pg/ml, based upon our preliminary approxima- 
tion that 1 uni t of absorbance at 280 nm ( m A 28 q m )# 
30 was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A- a0 unit of rsT4.2 being 
equivalent to 0.5 mg ox the protein. 
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INHIBITION IN C8166 FUS '.ON ASSAY 
Preparation frsT4.21 (pg/ml) 



buffer 
rsT4.2 
rsT4.2 
rsT4.2 
rsT4.2 

OKT4A (25 »g/ml) 



0 

0.5** 
5.0** 
1.25** 
12.5** 
0 



% Inhibition * 
2 Hrs 4 Hrs 



0 
30 
54 
16 
77 
100 



0 
42 
47 
21 
55 
100 



10 



15 



20 



As demonstrated in this table and in Fig- 
ure 40, soluble T4 according to this invention at 
5.0 pg/ml and 12*5 pg/ml inhibited syncytia formation 
at 2 hours, as compared to buffer alone. By 4 hours 
after the addition of C8166 cells, soluble T4 at 
12.5 pg/ml continued to inhibit greater than 50% 
syncytia formation, as compared to the negative 
control . 

We also evaluated the effect of two prep- 
arations of our soluble T4 protein rsX4.7 on the 
syncytiagenic properties of HIV in a similar co- 
cultivation assay. The results of this assay are 
shown below. 



* All assays were carried out in triplicate, and 
25 the n umber of syncytia counted per well was averaged 

to calculate % inhibition. The % inhibition repre- 
' sents the difference between the average number of 
syncytia in the negative control (without rsT4 or 
OKT4A) and the average number of syncytia counted 
30 when either rsT4 or OKT4A were present during the 
assay, divided by the average syncytia count for 
the negative control and multiplied by 100. 

** These concentrations were initially believed 
to be, respectively, 1 pg/ml, 10 pg/ml, 2.5 pg/ml 

35 and 25 pg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm ( "A-*' ) , 
was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein however, we found that it 
had a higher extinction coefficient than originally 

40 approximated, with 1 A-^q unit of rsT4.2 being 
equivalent to 0.5 mg oi^the protein. 
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INHIBITION IN C8166 FUSION ASSAY 
Average 

rsT4.7 Syncytia/SOpl % Inhibition 
5 Preparation (ug/ml) aliquot: at 2 Hrs 

H9 cells 0 0 N/A 

( control ) 

C8166 cells 0 0 N/A 

( control ) 

10 HIV-infected 0 118 0 

H9 cells 
added to 
C8166 cells 
( control ) 

15 OKT4A 0 0 100 

( control ) 

Prep* 1 of 

rsT4.7 S 5.0* 43 63.6 



20 * This concentration vas initially believed to 
be 10 pg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm ("K-qq" )* 
was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein, however, we found that it 

25 had a higher extinction coefficient than originally 
approximated, with 1 A Z80 unit of rsT4.2 being 
equivalent to 0.5 mg ox the protein. 

- 373i 89085519 



WO 89/01940 £ PCT/US88/02940 



-82- 



10 



assay aatfti ia 



15 



Preparation 

H9 cells 
( control ) 

C8166 cells 
( control ) 

HIV-infected 
H9 cells added 
to C8166 cells 
(control) 

OKT4A (control) 

Prep. 2 of 
rsT4.7 



rsT4 . 7 
(pq/ml) 



Average 
Syncytia/SOp 1 
aliquot 



141 



£ 5.0* 



27 



% Inhibition 
at 2 Hrs 

N/A 



N/A 
0 

100 
80.9 



* This concentration was initially believed to 
be 10 pg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 nm <" a 2 q 0 m )* 
20 was equivalent to 1 mg of rsT4.2. Upon amino acid 
analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A„ Q unit of rsT4.2 being 
equivalent to 0.5 mg ox^the protein. 
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ftssaY flats? flav n 

Average 

rsT4.7 Syncytia/SOpl % Inhibition 
Preparation (pg/ml ) aliquot: at 2 Hrs 

5 H9 cells 0 0 N/A 

(control) 

C8166 cells 0 0 N/A 

( control ) 

HIV-infected 0 128 0 

10 E9 cells added 
C8166 cells 
( control ) 

OKT4A (control) 0 0 100 

Prep. 1 of 

15 rsT4.7 S 5.0* 35 72.7 

Prep. 2 of 

rsT4.7 S S.O* 2 98.4 

As demonstrated in these tables , soluble 
T4 protein rsT4.7 inhibited syncytia formation in 

20 HIV-infected H9 cells. 

We also evaluated the effect of rsT4. 113.1 
and rsT4.111 on the syncytia genie properties of HIV in 
a co-cultivation assay. We carried out a C8166 cell 
fusion assay as described in Walker et al . , supra. 

25 We incubated 1 x 10 4 H9 cells chronically 

infected with HTLV-IIIB for 1 hour at 37°C in 5% 
C0 2 , with from 5 to 50 pg/ml rsT4. 113.1 or rsT4.111 
in 150 pi RPHI-1640 media supplemented with 20% 
fetal calf serum in 96-well microti ter plates. We 



30 



*- This concentration was initially believed to 
be 10 Mg/ml, based upon our preliminary approxima- 
tion that 1 unit of absorbance at 280 run ( w A 280 tt ), 
was equivalent to 1 mg of rsT4.2. Upon amino acid 
35 analysis of the protein, however, we found that it 
had a higher extinction coefficient than originally 
approximated, with 1 A-q Q unit of rsT4.2 being 
equivalent to 0.5 mg ox the protein. 
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then added 3 x 10 4 C8166 cells to the wells in 50 pi 
aliguots . The plates were incubated for 2 hours at 
37 P C in 5% C0 2 and, following this incubation, the 
number of syncytia per well were counted. 
5 Syncytia were defined as cells containing 

a ballooning cytoplasm greater than three cell 
diameters. All samples were counted twice. Parallel 
co-cultivation used OKT4A alone or rsT4 . 3 alone at a 
concentration of 25 pg/ml (positive controls) or H9 
10 cells alone or C8166 cells alone (negative controls). 
The results of this assay are shown below and in 
Figure 41. 



INHIBITION IN 


C8166 FUSION 


ASSAY 


Preparation 


rsT4(uq/ml) 


X Inhibition 


B9 cells (control) 


0 


0 


C8166 cells (control) 


O 


0 


rsT4 . 113 . 1 


1.25 


35 


rsT4. 113.1 


2.5 


63 


rsT4. 113.1 


4.25 


63 


rsT4. 113.1 


6.25 


82 


rsT4. 113.1 


12.5 


96 


rsT4.3 


12.5 


100 


OKT4A (25 pg/ml) 


0 


100 



As demonstrated in this table and in 
25 Figure 41, rsT4. 113.1 exhibited a dose-dependent 

inhibition of HIV-induced syncytia formation. The 
" molar specific inhibitory activity of rsT4. 113.1 
appeared to be reduced by an order of magnitude by 
comparison to anti-viral activity of longer forms of 
30 recombinant soluble T4. Thus, whereas rsT4. 113.1 is 
effective toward neutralization of HIV-dependent 
cell fusion in vitro , its molar specific inhibitory 
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activity is decreased by a factor of 10. It is 
undetermined whether this decreased potency is due 
to incomplete renaturation of the EiCOli-derived 
protein, the presence of three additional amino 
acids at the N- terminus of rsT4. 113.1 (Met-Gln-Gly ) 
lacking in rsT4.2 or rsT4 3 produced in mammalian 
cells, or the absence of additional structure in 
rsT4. 113.1 required for high-affinity binding to 
HIV. 

we also carried out a C8166 cell fusion 
assay with rsT4.111, as described for rsT4. 113.1. 
Tne results of this assay are shown below. 

INHIBITION IN C8166 FUSION ASSAY 



rsT4(uo/ml) 


X inhibition 


0 


0 


0 


0 


1.25 


0 


2.5 


40 


4.25 


20 


6.25 


67 


12-5 


100 


25.0 


100 


12.5 


100 


25.0 


100 


0 


100 



Preparation 

15 H9 cell (control) 

C8166 cells (control) 

rsT4.111 

rsT4.111 

rsT4 . Ill 
20 rsT4 . Ill 

rsT4.111 

rsT4 . Ill 

rsT4.3 

rsT4.3 
25 OKT4A (25 pg/ml) 

As demonstrated in this table, rsT4.111 
exhib ited a dose-dependent inhibition of HIV- induced 
syncytia formation. At a concentration of 12.5 pg/ml 
and 25.0 jig/"*' complete inhibition of cell fusion 
30 was achieved. 

Kinetics Of Intramuscular Injectio n Of Soluble T4 

We examined the kinetics of the appearance 
of a recombinant soluble T4 protein according to 
this invention (specifically. rsT4.3 from the pBC381- 
35 transfected cell line BG381) in serum after intra- 
muscular injection as follows. 
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We obtained two cynomolgus monkeys ( Macaca 
fascicularis ) who were free of infectious disease 
and in good health. Each monkey had been subjected 
to a 6 week quarantine period prior to administration 
5 of the soluble T4 protein. Throughout the adminis- 
tration period, each monkey was maintained on a con- 
ventional diet of monkey chow supplemented with fresh 
fruit. A catheter and a vascular access port were 
surgically placed in a femoral vein of each animal 

10 prior to treatment in order to facilitate blood 
collection. 

Over a period of 28 days, each animal 
received recombinant soluble T4 protein twice daily 
by intramuscular injection to the large muscles of 

15 the thighs or buttocks. Injections were administered 
to each animal 8 hours apart and each injection con- 
tained a volume of 0.15 ml/kg (0.25 mg/kg) of rsT4.3 
(from the pBG381- trans formed cell line BG381 ) , for a 
total dose of 0.5 mg/kg/day /monkey . Serum samples 

20 for clearance determination were collected on day 1 
before the first treatment and at 1, 2, 4 and 
8 hours after the first injection, as well as 1, 2, 
4, 14 and 16 hours after the second injection on 
days 7, 14 and 28. 

25 We found that intramuscularly injected 

soluble T4 reached the maximum level in serum between 

1 and 2 hours after injection, with the level falling 
off slowly and reaching half-maximum value at approxi- 
mately 6 hours post-injection. According to data 

30 obtained for intravenous administration (not shown), 
the level of rsT4.3 in serum should drop below that 
attained via intramuscular injection aproximately 

2 hours after intravenous injection. Thus, while 
the maximum rsT4.3 level in serum after intramuscular 

35 injection does not reach that attainable via intra- 
venous injection, it is slowly released into the 
blood stream, remaining detectable in serum for a 
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much longer time. This slow rel.-ase mechanism asso- 
ciated with intramuscular routes of injection is 
advantageous because a higher level of soluble T4 
protein is available over a longer period of time 
5 over a given concentration; thus remaining in a sus- 
tained level. intramuscular administration of solu- 
ble T4 protein is particularly useful in treating 
early stage HIV-infected patients, to prevent the 
virus from disseminating, or in treating patients 
10 who have been exposed to the virus and who are not 
yet seropositive. 

we determined serum levels of rsT4-3 using 
an ELISA assay- Throughout this assay, dilutions 
were made in blocking solution and, between each 
15 step, we washed the plates with PBS/0.05% Tween-20. 
More specifically, we coated wells of Immulon 2 
plates with .01 OD (280 nm)/ml of OKT4 (IgG2b) in 
0.05 M bicarbonate buffer to a volume of 50 pl/wll 
and incubated the plates overnight at 4°C. We then 
20 blocked the plates with 5% bovine serum albumin in 
PBS, 200 pl/wll, and incubated for 30 minutes at 
room temperature. 

Subsequently, we added 50 pi of sample or 
standard to each well, incubating for 4 hours at 
25 room temperature - We then added 50 pl/well of OKT4A 
at 0.1 Mg/ml and incubated overnight at 4°C- Using 
a Hyclone Kit (Hyclone) we then carried out the fol- 
lowing steps. First, we added 1 drop of rabbit 
anti-mouse IgG2a to each well and incubated the 
30 plates for 1 hour at room temperature. We then added 
.100 pi of peroxidase-labeled anti-rabbit IgG, diluted 
1:4000 with 5% BSA/PBS to each well, and incubated 
for 1 hour at room temperature . 

We prepared a substrate reagent as follows. 
35 We diluted substrate reagent 1:10 in distilled water 
and added two O-phenyl -ethylene-diamine ( "OPD" ) 
chromophore tablets per 10 ml of substrate. We let 
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the mixture dissolve thorough!; by mixing with a 
vortex. Alternatively, a TMB peroxidase substrate 
system (Kirkegaard & Perry Catalogue #50-76-00) may 
be used. Subsequently, we added 100 pi of the 
5 chromophore solution to each well, incubated for 
10-15 minutes at room temperature and then stopped 
the color development with 100 m1 of IN H^C^. We 
then measured OD at 490 nm, using an EL.ISA plate 
reader. 

10 The results of the assay are demonstrated 

in the tables below. 
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rsT4 Level 
( nq/ml ) 



** 





Time ( hr ) 


Day 1 


Day 7 


Day 14 


Day 28 


5 


0 


22.7* 


96.5 


158.0 


19.8 




1 


278.8 


199.6 


360.7 


238 .3 




2 


281.8 


366.8 


306.4 


441.1 




4 


214.9 


246.6 


363.9 


393.2 




5 








290.4 




8 


72.3 


105.0 


199.4 






9** 


246.2 










10 


259.6 










12 


136.0 










22 


23.8 








15 


24 


13.4 










Monkey #7-92 


rsT4 Level 






















<n<?/ml) 








Time(hr) 


Day 1 


Day 7 


Day 


Day 28 


20 


0 


6.7* 


56.0 


106.3 


60.9 


1 


87.2 


225.8 


178.0 


437.7 




2 


254.2 


377.9 
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- background 

- second injection administered after the 
collection of the 8 hour sample. 



Polyvalent Forms Of Recombinant Soluble T4 

35 Receptors may be characterized by their 

affinity for specific ligands, such that, at equili- 
brium, the intrinsic affinity (K ) between monovalent 

a 

receptor and monovalent ligand can be defined as 
[RL]/[R f ] [L f ] , where [RL] is the concentration of 
40 receptor (R) bound to ligand (L) and [R f ] and [L f ] 
are the concentrations of free receptor and ligand, 
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respectively {P. A . Unc^rwood, in Advances In Virus 
Research , ed. K. Maramorosch et al., 34, pp. 283-309 
(1988)] . 

For a polyvalent receptor (with a valency 
5 of n) binding to a polyvalent ligand (with a valency 
of m), a functional affinity can be defined as 
n[R b ]/n[R f ]m[L f ], where [1^] is the concentration of 
bound receptor sites, and n[R f ] and s[L f ] are, respec-. 
tively, the concentrations of free receptor and 
10 ligand binding sites. The effect of increasing the 
valence (the number of binding sites) is to enhance 
the stability of ligand-receptor complexes. The 
affinity of a polyvalent receptor for a polyvalent 
ligand will depend on three factors: the intrinsic 
15 association constant of each binding site, the 

valency (number of binding sites) and the topico- 
logical relationship between the receptor and ligand 
binding sites. Under some circumstances, polyvalent 
binding interactions will lead to higher functional 
20 affinity. The decreased dissociation rate of poly- 
valent ligands with polyvalent receptors results in 
an increased functional affinity [C. L. Hornick and 
F. Karush, Immunochemistry , 9, pp. 325-40 (1972); 
I. Otterness and F. Karush, "Principles Of Antibody 
25 Reactions 11 , in Antibody As A Tool , ed. J. J. 

Marchalonais and G.W. Warr, pp. 97-137 (1982)]. 

The simplest case for receptor polyvalency 
increasing functional affinity is represented by a 
bivalent soluble receptor, such as an antibody 
30 molecule, which has two identical ligand binding 

sites, each capable of independently binding antigen 
with equal affinity. If the antigen is displayed 
polyvalently, for example, chemically coupled to a 
solid support such that the spacing between antigenic 
35 sites can be bridged by the antibody's two antigen 

binding arms, the functional affinity of the antibody 
for the antigen coupled to the solid support would be 
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greater than the intrinsic affinity cf the antibody 
binding site for the monovalent antigen [D. Crothers 
and H. Metzger, Immunochemistry , 9, pp. 341-57 
( 1972 ) J . Because virus particles represent poly- 
5 valent antigens, the greater functional affinity of 
antibodies for polyvalent antigens is an important 
factor for antibody-directed virus neutralization. 

The association of recombinant soluble T4 
and the HIV major envelope glycoprotein gpl20 is an 

10 example of monovalent receptor binding to monovalent 
ligand. The affinity of this interaction has been 
measured, and the association between T4 and gp!20 
has a dissociation constant K d = 4 x 10 M [L. La sky 
et al., Cell , SO, pp. 975-88 (1987)]. 

15 Using the antibody analogy, we believe 

that polyvalent rsT4 will demonstrate a greater 
affinity for HIV-infected cells displaying gpl20 
than monovalent rsT4 and the topicological relation- 
ship between gpl20 on the virus particle or the 

20 infected cell surface, will determine the degree to 
which polyvalent rsT4 exhibits higher functional 
affinity than monovalent rsT4. One example of a 
polyvalent rsT4 is described below, with respect to 
the production of a recombinant bivalent rsT4 con- 

25 sisting of two tandem repeats of amino acids 3-178, 

followed by the C- terminal 199 amino acids of rsT4.3. 
According to this invention, a "polyvalent" receptor 
possesses two or more binding sites for a given 
ligand. Furthermore, the intrinsic affinity of each 

30 ligand binding site of a given polyvalent receptor 
need not be identical. 

As shown in Figure 42, to construct bivalent 
rsT4, we digested pBG391 with Nhe l , which cleaves 
after the valine at position 178 in rsT4, and removed 

35 the Nhe l 5' overhang with mung bean nuclease. Next, 
we cleaved with Bql l I to remove the C-terminal half 
of the rsT4 coding sequence in pBG391. Finally, we 
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ligated a Dral-Bglll fragment contemning the coding 
sequence for rsT4 amino acids 3 (lysine) through 377 
(isoleucine) to the cleaved pBG391 to create pBiv.l, 
a plasmid coding for a fusion protein with a tandem 
5 duplication of the N- terminal 176 amino acids of 

rsT4, followed by the C-terminal 199 amino acids of 
rsT4.3. The protein produced by this plasmid, 
therefore, contains two adjacent N- terminal gpl20- 
binding or OKT4A-binding domains (defined by amino 
10 acid residues 3 through 111 of rsT4.111), followed 
by one OKT4-binding C-terminal domain (Figure 43). 

pBiv.l was transfected by electroporation 
into COS 7 cells to test expression of the bivalent 
rsT4 protein. Three days later, we tested the con- 
15 ditioned medium of the transfected cells for the 
presence of the rsT4 bivalent protein by immuno- 
precipitation, followed by Western blot analysis of 
the precipitated protein. Both OKT4A and OKT4 were 
used for immuno-precipitation to determine that the 
20 OKT4 epitope and at least one of the OKT4A epitopes 
had folded correctly. Both antibodies precipitated 
a protein of the predicted apparent molecular weight 
(60,000d) from the conditioned medium of the cells. 

Bivalent rsT4 may be purified by immuno- 
25 affinity purification from an OKT4 column and the 

purified protein may then be used to perform quanti- 
tative competition assays with rsT4.3. We believe 
that the bivalent molecule would demonstrate equi- 
valent competition against rsT4.3 for OKT4 binding, 
30 but significantly greater competition against mono- 
valent rsT4 for OKT4A binding. The ability of 
bivalent recombinant soluble T4 to block syncytium 
formation may also be demonstrated in the C8166 
fusion assay- We also believe that bivalent 
35 recombinant soluble T4 would block syncytium 

formation at significantly lower concentrations 
than monovalent rsT4; based upon the higher 
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functional affinity of bivalent recombinant 
soluble T4 for gpl20. 

According to alternate embodiments of this 
invention, other methods for producing polyvalent 
5 rsT4 may be employed. For example, polyvalent rsT4 
may be produced by chemically coupling rsT4 to any 
clinically acceptable carrier molecule, a polymer 
selected from the group consisting of Ficoll, poly- 
ethylene glycol or dextran, using conventional 
10 coupling techniques. Alternatively, rsT4 may be 
chemically coupled to biotin, and the biotin-rsT4 
conjugate then allowed to bind to avidin, resulting 
in tetravalent avi din/bio tin/rsT4 molecules- And 
rsT4 may be covalently coupled to dinitrophenol 
15 (DNP) or trinitrophenol (TNP) and the resulting 

conjugate precipitated with anti-DNP or anti-TNP- 
Igm, to form decameric conjugates with a valency of 
10 for rsT4 binding sites. 

Alternatively, a recombinant chimeric 
20 antibody molecule with rsT4 sequences substituted 
for the variable domains of either or both of the 
immunoglobulin molecule heavy and light chains may 
be produced. Because recombinant soluble T4 
possesses gp!20 binding activity, the construction 
25 of a chimeric antibody having two soluble T4 domains 
and having unmodified constant region domains could 
serve as a mediator of targeted killing of HIV- 
infected cells that express gp!20. 

For example, chimeric rsT4/IgG 1 may be 
30 produced from two chimeric genes — an rsT4 /human 
kappa light chain chimera ( rsT4/C kappa ) and an 
rsT4/human gamma 1 heavy chain chimera 

IrrtVCgaM-i)- Both C kappa and C ganuna-1 ^ions 
have been isolated from human recombinant DNA 

35 libraries, and each has been subcloned into animal 

cell selection vectors containing either the 

bacterial neo resistance or bacterial gpt markers 
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for selection in anima. 1 cell hosts against the 
antibiotic G418 or mycophenolic acid, respectively. 

To construct rsT4/C ganuna . 1 and rsT4/C kappa 
chimeric genes, an rsT4 gene segment, including at 
5 least the secretory signal sequence and the N-terminal 
110 amino acid residues of the mature rsT4 coding 
sequence and including a splice donor or portion 
thereof, is placed upstream of the gamma-1 and kappa 
constant domain exons. A suitable restriction 
10 enzyme may be used to cut within the intron down- 
stream of the desired rsT4 coding sequence, thus 
providing a donor splice site. Subsequently, a 
suitable restriction enzyme is used to cut within 
the introns upstream of the kappa and gamma-1 
15 coding regions. The rsT4 sequence is then joined to 
the kappa or gamma-1 constant region sequence, such 
that the rsT4 intron sequence is contiguous with the 
gamma-1 and kappa introns. In this way, an acceptor 
splice site is provided by the kappa or gamma-1 
20 constant region intron. Alternatively, rsT4 chimeric 
genes may be constructed without the use of introns, 
by fusing a suitable rsT4 cDNA gene segment directly 
to the gamma-1 or kappa coding regions. 

The rsT4/C gamna ^ 1 and rsT4/C kappa vectors 
25 may then be co trans fected, for example, by electro- 
poration into lymphoid or non- lymphoid host cells. 
Following transcription and translation of the two 
chimeric genes, the gene products may assemble into 
chimeric antibody molecules. 
30 Expression of the chimeric gene products 

may be measured by an enzyme-linked immunoadsorbant 
assay (ELISA) that utilizes monoclonal anti-T4 anti- 
body OKT4A, as described infra, or in gp!20 competi- 
tion assays and radioimmunoassays , as described infra. 
35 Activity of the rsT4/IgG 1 chimeras may be measured 
by incubating them with HIV-infected cells in the 
presence of human complement, followed by quantitating 
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subseguent complement-mediated lyr\s of these cells. 
Alternatively, activity may be measured in HIV repli- 
cation and HIV syncytium assays as described infra. 

In order to determine if bivalent rsT4 has 
5 a greater potency than monovalent rsT4 , we mixed 
OKT4, at various concentrations, together with a 
constant concentration of rsT4, so that the molar 
ratio of OKT4 ; rsT4 varied between 0.2 and 4. After 
preincubating the mixture overnight at 4°C, we added 

X0 aliquots to the HIV syncytium assay described infra. 

OKT4 has no observable effect in this assay when used 
alone. In addition, the concentration of recombinant 
soluble T4 chosen did not cause inhibition in this 
assay. Accordingly, we looked for indications that 

IS the OKT4/rsT4 mixture was more potent than rsT4 alone. 
We observed that at ratios of OKT4:rsT4 greater than 
0.2, partial to complete inhibition of syncytium 
formation occurred. We believe that under conditions 
where two rsT4 molecules are bound to 1 OKT4 molecule, 

20 the greatest inhibitory effect should be found. 

Thus, polyvalent, as well as monovalent 
forms of recombinant soluble T4 are useful in the 
compositions and methods of this invention. 

Microorganisms and recombinant DNA mole- 

25 cules prepared by the processes of this invention 

are exemplified by cultures deposited in the In Vitro 
International, Inc. culture collection, in Linthicum, 
Maryland, on September 2, 1987 , and identified as: 





BG378: 


E. 


coli 


MC1061/pBG378 


30 


199-7; 


E. 


coli 


MC1061/pl99-7 




170-2: 


E. 


coli 


JA221/pl70-2 




EC100: 


E. 


coli 


JM83/pEC100 




BG377: 


E. 


coli 


MC1061/pBG377 




BG380. 


E. 


coli 


MC1061/pBG380 


35 


BG381 


E. 


coli 


MC1061/pBG381 



These cultures were assigned accession 
numbers IVI 10143-10149. respectively. 
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In addition, microorganisms and recombinant: 
DNA molecules according to this invention are exempli- 
fied by cultures deposited in the In Vitro Interna- 
tional, Inc. culture collection, in Linthicum, 
5 Maryland, on January 6, 1988, and identified as: 



BG-391: 


E- 


coli 


MC1061/pBG391 


BG-392 : 


E. 


coli 


MC1061/pBG392 


BG-393: 


E. 


coli 


HC1061/pBG393 


BG-394: 


E. 


coli 


MC1061/pBG394 


BG-396: 


E. 


coli 


MC1061/pBG396 


203-5 : 


E. 


coli 


SG936/p203-5. 



These cultures were assigned accession 
numbers IVI 10151-10156, respectively, 
i Microorganisms and recombinant DNA mole- 

15 cules according to this invention are also exempli- 
fied by cultures deposited in the In Vitro 
International, Inc. culture collection, in Linthicum, 
Maryland, on August 24, 1988 and identified as: 
211-11: E.coli A89/pBG211-ll 
20 214-10: E.coli A89/pBG214-10 

215-7 : E.coli A89/pBG215-7 

These cultures were assigned accession 
numbers IVI 10183-10185 respectively. 

While we have hereinbefore described a 
25 number of embodiments of this invention, it is 

apparent that our basic constructions can be altered 
to provide othe embodiments which utilize the pro- 
cesses and compositions of this invention. There- 
fore, it will be appreciated that the scope of this 
30 invention is to be defined by the claims appended 

hereto rather than by the specific embodiments which 
have been presented hereinbefore by way of example. 
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C LAIMS 

We claim: 

1. A DNA sequence selected from the group 
consisting of: 

(a) the DNA inserts of pl99-7, pBG377. 
PBG380, pBC381, p203-S. pBG391. pBG392. pBC393. pBC394, 
pBG395, pBG396, pBG397. -211-11, p214-10 and p21S-7; 

(b) DNA sequences which hybridize 

to one or more of the foregoing DNA inserts and which 
code on expression for a soluble T4-like polypeptide; 
and 

(c) DNA sequences which code on 
expression for a soluble T4-like polypeptide coded 
for on expression by any of the foregoing DNA inserts 
and sequences. 

2. The DNA sequence according to claim 1, 
wherein said DNA sequence (b) codes on expression 
for a soluble T4-like polypeptide which inhibits 
adhesion between T4* lymphocytes and infective agents 
which target T4* lymphocytes and which inhibits 
interaction between T4* lymphocytes and antigen pre- 
senting cells and targets of T4* lymphocyte mediated 
killing. 

3. A recombinant DNA molecule comprising 
a DNA sequence selected from the group consisting of 
the DNA sequences of claim 1 or 2, said DNA sequence 
being operatively linked to an expression control 
sequence in said recombinant DNA molecule. 

4. The recombinant DNA molecule according 
to claim 3, wherein said expression control sequence 
is selected from the group consisting of the early 
or late promoters of SV40 or adenovirus, the lac 
system, the trp system, the TAC system, the TRC 
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system, the major operator and promoter regions of 
phage k, the control regions of fd coat protein, the 
promoter for 3-phosphoglycerate kinase or other 
glycolytic enzymes, the promoters of acid phosphatase, 
5 the polyhedron promoter of the baculovirus system 
and the promoters of the yeast o -mating factors. 

5. A unicellular host transformed with a 
recombinant DNA molecule selected from the group 
consisting of the recombinant DNA molecules of claim 3 

10 or 4. 

6. The host according to claim 5, wherein 
said host is selected from the group consisting of 
strains of E.coli , Pseudomonas , Bacillus , 

Strep tomyces , fungi, animal cells, plant cells, 
15 insect cells and human cells in tissue culture. 

7. A polypeptide coded for on expression 
by a DNA sequence selected from the group consisting 
of the DNA sequences of claim 1 or 2, said polypep- 
tide being essentially free of other proteins of 

20 human origin. 

8. The polypeptide according to claim 7, 
wherein said polypeptide is selected from the group 
consisting of a polypeptide of the formula AA _23~ AA 362 
of Figure 3, a polypeptide of the formula ^2.-362 af 

25 Figure 3, a polypeptide of the formula Met-AA i-362 

of Figure 3, a polypeptide of the formula ^^-374 °^ 
Figure 3, a polypeptide of the formula Met-AA 1-374 
of Figure 3, a polypeptide of the formula AA 1 _377 of 
Figure 3, a polypeptide of the formula Met-AA 1 _ 377 

30 of Figure 3, a polypeptide of the formula AA _23~ AA 374 
of Figure 3, a polypeptide of the formula AA _ 2 3~ AA 377 
of Figure 3. 
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9. The polypeptide accrrding to claim 7 , 
wherein said polypeptide is selected fi^m the 
group consisting of a polypeptide of the formula 
AA _23~ AA 182 °* Fi 9 ure 16 » a polypeptide of the 
5 formula AA i~ AA i82 of Figure 16, a polypeptide of 

the formula Met-AA 1-182 of Figure 16, a polypeptide 
of the formula AA_ 23 -AA 1Q2 of Figure 16, followed 
by the amino acids asparagine-leucine-glutamine- 
histidine-serine-leucine, a polypeptide of the formula 

LQ AA 1 -AA 1Q2 of Figure 16 , followed by the amino acids 

asparagine-leucine-glutamine-histidine-serine-leucine, 
a polypeptide of the formula Met-AA 1-182 of Figure 16/ 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 

15 the formula AA_ 23 -AA 113 of Figure 16 , a polypeptide 
of the formula AA 1 -AA 113 of Figure 16 , a polypeptide 
of the formula Met-AA 1-113 of Figure 16, a polypeptide 
of the formula AA_ 2 3 — AA^ ^ ^ of Figure 16, a polypeptide 
of the formula ^x^^lll of Fi 9**re 16, a polypeptide 

20 of the formula ^^"^1^x11 of ^i^ 11 " 16 , a polypep- 
tide of the formula AA_ 23 ~AA 131 of Figure 16, a poly- 
peptide of the formula AA 1 -AA 131 of Figure 16, a 
polypeptide of the formula Met-AA 1 _^ 31 of Figure 16, 
a polypeptide of the formula AA_ 23 -AA 145 of Figure 16, 

25 a polypeptide of the formula AA X -AA 145 of Figure 16, 

r polypeptide of the formula Met-AA 1-145 of Figure 16, 
a polypeptide of the formula ^.23*^166 o£ n?*** 16 , 
a polypeptide of the formula AA 1 -AA 166 of Figure 16, 
a polypeptide of the formula Met-AA^_ lfi g of Figure 16, 

30 or portions thereof. 

10. The polypeptide according to claim 7, 
wherein said polypeptide is selected from the group 
consisting of a polypeptide of the formula AA _23~ AA 362 
of mature T4 protein, a polypeptide of the formula 
35 ^1-362 °^ mat:ure T4 protein, a polypeptide of the 
^^"^ignula M et -AA 1 .352 of mature T4 protein, a polypep- 
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tide of the formula AA X _ 374 of m cure T4 protein, a 
polypeptide of the formula Met-AA i_374 of mature T4 
protein, a polypeptide of the formula AA 1-377 of 
mature T4 protein, a polypeptide of the formula 
5 Met-AA 1-377 of mature T4 protein, a polypeptide of 

the formula AA_ 23 ~AA 374 of mature T4 protein, a poly- 
peptide of the formula AA_ 23 ~AA 377 of mature T4 pro- 
tein, or portions thereof. 

11. The polypeptide according to claim 7, 

10 wherein said polypeptide is selected from the group 

consisting of a polypeptide of the formula AA_ 23 -AA 1Q2 
of mature T4 protein, a polypeptide of the formula 
AA 1 -AA 182 of mature T4 protein, a polypeptide of the 
formula Met-AA 1 _ 1Q2 of mature T4 protein, a polypep- 

15 tide of the formula AA_ 23 -AA 182 of mature T4 protein, 
followed by the amino acids asparagine-leucine- 
glutamine-histidine-serine-leucine, a polypeptide of 
the formula AA 1 -AA 1Q2 of mature T4 protein, followed 
by the amino acids asparagine-leucine-glutamine- 

20 histidine-serine-leucine, a polypeptide of the formula 
Met-AA 1-182 of mature T4 protein, followed by the 
. amino acids asparagine-leucine-glutamine-histidine- 
serine-leucine, a polypeptide of the formula 
AA_ 23 -AA 113 of mature T4 protein, a polypeptide of 

25 the formula AA~-AA 113 of mature T4 protein, a polypep- 
tide of the formula Met-AA^^^ of mature T4 protein, 
a polypeptide of the formula AA .23 -AA 111 of mature 
T4 protein, a polypeptide of the formula ^i^^lll 
of mature T4 protein, a polypeptide of the formula 

30 Met-AA 1 _ 111 of mature T4 protein, a polypeptide of 

the formula AA «23"" AA 131 o£ mature T4 protein, a poly- 
peptide of the formula ^1*^131 of mature T4 protein, 
a polypeptide of the formula Met-AA 1 ^ 131 of mature 
T4 protein, a polypeptide of the formula AA_ 23 -AA 145 

35 of mature T4 protein, a polypeptide of the formula 

AA 1 -AA 145 of mature T4 protein, a polypeptide of the 
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formula Met-AA^^g of mature T4 protein, a polypep- 
tide of the formula AA _23~ AA 166 of mature T4 protein, 
a polypeptide of the formula AA^-AA^^ of mature T4 
protein, a polypeptide of the formula Met-AA 1-166 of 
5 mature T4 protein, or portions thereof. 

12. A method for producing a polypeptide 
selected from the group consisting of the polypeptides 
of any one of claims 7 to 11 comprising the step of 
culturing a unicellular host transformed with a recom- 

10 binant DNA molecule selected from the group consisting 
of the recombinant DNA molecules of claim 3 or 4. 

13. A pharmaceutical composition comprising 
an immuno therapeutic or immunosuppressive effective 
amount of a polypeptide selected from the group con- 

15 sisting of the polypeptides of any one of claims 7 to 
11 and a phaxmaceutlcally acceptable carrier. 



14. A method for treating patients com- 
prising the step of treating them in a pharmaceuti- 
cal!^ acceptable manner with a composition selected 

20 from the group consisting of the composition of 
claim 13. 

15. The method according to claim 14, 
wherein the patient is treated by intramuscular 
injection of the composition. 

25 16. A diagnostic composition for detecting 

or- for monitoring the course of HIV infection com- 
prising a diagnostic effective amount of a polypeptide 
selected from the group consisting of the polypeptides 
of any one of claims 7 to 11. 

30 17 . A method for detecting or for moni- 

toring the course of HIV infection comprising the 
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step of employing as - diagnosti - a composition 
selected from the group consisting or the compositions 
of claim 16. 

18. A means for detecting or for monitoring 
the course of HIV infection comprising a composition 
selected from the group consisting of the compositions 
of claim 16. 

19. A pharmaceutical composition compris- 
ing an immunotherapeutic or immunosuppressive effec- 
tive amount of antibody to a polypeptide selected 
from the group consisting of the polypeptides of any 
one of claims 7 to 11 and a pharmaceutical^ accept- 
able carrier. 

20. A method for treating patients com- 
prising the step of treating them in a pbarmaceuti- 
cally acceptable manner with a composition according 
to claim 19. 

21. The use of a polypeptide selected 
from the group consisting of the polypeptides of any 
one of claims 7 to 11 to purify HIV virus. 

22. The use according to claim 20, wherein 
the HIV virus is purified from a biological sample. 

23. A method for purifying HIV virus from 
a sample comprising the step of exposing the sample 

25 to a polypeptide selected from the group consisting 
of the polypeptides of any one of claims 7 to 11. 

24. The method according to claim 22. 
wherein the sample is a biological sample. 



IS 
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25. A DNA sequence com- rising the DNA 
insert of p!70-2, said sequence coding on expression 
for a T4-like polypeptide. 

26. A recombinant DNA molecule comprising 
S a DNA sequence selected from the group consisting of 

the DNA sequence of claim 25, said DNA sequence 
being opera tively linked to an expression control 
sequence in said recombinant DNA molecule. 

27. A unicellular host transformed with a 
10 recombinant DNA molecule according to claim 26 . 

28. A polypeptide coded for on expression 
by a DNA sequence of claim 25/ said polypeptide being 
essentially free of other proteins of human origin. 

29. A pharmaceutical composition comprising 
15 an immuno therapeutic or immunosuppressive amount of a 

soluble protein receptor and a pharmaceutical ly 
acceptable carrier. 

30. A method for treating patients 
comprising the step of treating them in a phanna- 

20 ceutically acceptable manner with a pharmaceutical 
composition of claim 29. 

31. A diagnostic composition for detecting 
or for monitoring the course of viral infection com- 
prising a diagnostic effective amount of a soluble 

25 protein receptor. 

32. A method for detecting or for moni- 
toring the course of a viral infection comprising 
the step of employing as a diagnostic a diagnostic 
effective amount of a soluble protein receptor. 
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33. A means for detecting or for moni- 
toring the course of a viral infecticn comprising 
a soluble protein receptor. 

34. A DNA sequence selected from the group 
consisting of: 

(a) the DNA insert of pBiv.l; 

(b) DNA sequences which hybridize to 
the DNA insert of pBiv.l and which code on expression 
for a polyvalent soluble T4-like polypeptide; and 

(c) DNA sequences which code on 
expression for a polyvalent soluble T4-like polypep- 
tide coded for by the DNA insert of pBiv.l. 

35. A recombinant DNA molecule comprising 
a DNA sequence selected from the group consisting of 
the DNA sequences of claim 34, said DNA sequence 
being operatively linked to an expression control 
sequence in said recombinant DNA molecule. 

36. A unicellular host transformed with a 
recombinant DNA molecule according to claim 35. 

37. A polypeptide coded for on expression 
by a DNA sequence selected from the group consisting 
of the DNA sequences according to claim 34, said 
polypeptide being essentially free of other proteins 
of human origin. 

38. The polypeptide according to claim 7, 
wherein said polypeptide is polyvalent. 

39. A method for producing a polyvalent 
polypeptide comprising the steps of: 

(a) culturing a unicellular host 
transformed with a recombinant DNA molecule according 
to claim 3 or 4 to produce a polypeptide; and 
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(b) coupling saia polypeptide to a 
carrier to form* a polyvalent polypeptide. 

40. A DNA sequence comprising: 

(a) a first portion comprising a DNA 
sequence coding for the constant region of an immuno- 
globulin light chain; and 

(b) a second portion comprising a 
DNA sequence according to claim i or ?, or portions 
thereof, said second portion being joined upstream 
of said first portion. 

41. A DNA sequence comprising: 

(a) a first portion comprising a DNA 
sequence coding for the constant region of an immuno- 
globulin heavy c h a i n; and 

(b) a second portion comprising a 
DNA sequence according to claim 1 or 2, or portions 
thereof , said second portion being joined upstream 
of said first portion. 

42. An expression vector comprising the 
DNA sequence according to claim 40. 

43. An expression vector comprising the 
DNA sequence according to claim 41. 

44. An expression vector comprising the 
DNA sequence according to claim 40 and the DNA 
sequence according to claim 41. 

45. A method for producing a chimeric 
rsT4/IgG 1 comprising the step of co-transfecting 
host cell with the expression vector according to 
claim 42 and the expression vector according to 
claim 43. 
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46. A method for producing a chimeric 
rsT4/IgG 1 comprising the step of transfecting a host 
cell with the expression vector according to claim 44, 

47. A chimeric rsT4/IgG 1 produced by the 
method according to claim 45 or 46. 

48. A pharmaceutical composition comprising 
an immunotherapeutic or immunosuppressive effective 
amount of a polypeptide according to claim 37 or 38. 

49. A method for treating patients com- 
prising the step of treating them in a pharmaceutic 
cally acceptable manner with a composition according 
to claim 48. 

50. A diagnostic composition for detecting 
or for monitoring the course of HIV infection com* 
prising a diagnostic effective amount of a polypeptide 
according to claim 37 or 38. 

51. A pharmaceutical composition comprising 
an immunotherapeutic or immunosuppressive effective 
amount of a chimeric rsT4/IgG 1 according to claim 47. 

52. A method for treating patients com- 
prising, the step of treating them in a pharmaceutic 
cally acceptable manner with a composition according 
to claim 51. 
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101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 
151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 
201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 
251 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 
301 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 
351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 
401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 
451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGAA 
501 GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 
551 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 
601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 
651 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 
701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 
751 CTCTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 
801 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 
851 TGCTGATGAT GT A ATT A A AG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 
901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 
951 TGACATGCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 
1001 TGGATCCAAG CTTCGACTCG AGGAATTCCC CGAAGGAACA AAGCACCCTC 
1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 
1101 GTTTGAGAAG CAGCGGGCAA GAAAGACGCA AGCCCAGAGG CCCTGCCATT 
1151 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 
1201 GCCACAATGA ACCGGGGAGT cccttttagg cacttgcttc tggtgctgca 
1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAAAGTG GTGCTGGGCA 
1301 AAAAAGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA GAAGAAGAGC 
135 X ATACAATTCC ACTGGAAAAA CTCCAACCAG ATAAAGATTC TGGGAAATCA 
X4UI GGGCTCCTTC TTAAC"WA4GoQ T £CA3;CCAA GCTGAATGAT CGCGCTGACT 
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1501 CTTAAGATAG AAGACTCAGA TACTTACATC TGTGAAGTGG AGGACCAGAA 
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1651 AGTAGCCCCT CAGTGCAATG TAGGAGTCCA AGGGGTAAAA ACATACAGGG 

1701 GGGGAAGACC CTCTCCGTGT CTCAGCTGGA GCTCCAGGAT AGTGGCACCT 

1751 GGACATGCAC TGTCTTGCAG AACCAGAACA AGGTGGAGTT CAAAAT AGAC 

1801 ATCGTGGTGC TAGCTTTCCA GAAGGCCTCC AGCATAGTCT ATAAGAAAGA 

1851 GGGGGAACAG GTGGAGTTCT CCTTCCCACT CGCCTTTACA GTTGAAAAGC 

1901 TGACGGGCAG TGGCGAGCTG TGGTGGCAGG CGGAGAGGGC TTCCTCCTCC 

195 3 AAGTCTTGGA TCACCTTTGA CCTGAAGAAC AAGGAAGTGT CTGTAAAACG 

2001 GGTTACCCAG GACCCTAAGC TCCAGATGGG CAAGAAGCTC CCGCTCCACC 

2051 TCACCCTGCC CCAGGCCTTG CCTCAGTATG CTGGCTCTGG AAACCTCACC 

2101 CTGGCCCTTG AAGCGAAAAC AGGAAAGTTG CATCAGGAAG TGAACCTGGT 

2151 GGTGATGAGA GCCACTCAGC TCCAGAAAAA TTTGACCTGT GAGGTGTGGG 

2201 GACCCACCTC CCCTAAGCTG ATGCTGAGTT TGAAACTGGA GAACAAGGAG 

2 251 GCAAAGGTCT CGAAGCGGGA GAAGGCGGTG TGGGTGCTGA ACCCTGAGGC 
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2501 TTAAGTGTAT AATGTGTTAA ACTACTGATT CTAATTGTTT GTGTATTTTA 

2551 GATTCCAACC TATGGAACTG ATGAATGGGA GCAGTGGTGG AATGCCTTTA 

2601 ATGAGGA A A A CCTGTTTTGC TCAGAAGAAA TGCCATCTAG TGATGATGAG 

2651 GCTACTGCTG ACTCTCAACA TTCTACTCCT CCAAA AAAGA AGAGAAAGGT 

2701 AGAAGACCCC AAGGACTTTC CTTCAGAATT GCTAAGTTTT TTGAGTCATG 

2751 CTGTGTTTAG TAATAGAACT CTTGCTTGCT TTGCTATTTA CACCACAAAG 

2801 GAAAAAGCTG CACTGCTATA CAAGAAAATT ATGGAAAAAT ATTCTGTAAC 

2851 CTTTATAAGT AGGCATAACA GTTATAATCA TAACATACTG TTTTTTCTTA 

2901 CTCCACACAG GCATAGAGTG TCTGCTATTA ATAACTATGC TCAAAAATTG 

2951 TGTACCTTTA GCTTTTT AAT TTGTAAAGGG GTTAATAAGG AATATTTGAT 

3001 GTATAGTGCC TTGACT AGAG ATC ATAATCA GCCATACCAC ATTTGTAGAG 

305 1 GTTTTACTTG CTTTA^4|ftQtCgQCfiACAC CTCCQCCTGA ACCTGAAACA 



PC I /L' 588/02940 



w» Fl3 , 5 / cont ' d ) 

3101 T A A A ATCA A T GCAATTGTTG TTGTTAACTT GTTTATTGCA GCTTAT AATG 

31S1 GTT A C A A AT A AAGCAATAGC ATCACAAATT tcacaaataa AGCATTTTTT 

3201 TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG TATCTTATCA 

3251 TGTCTGGATC CTCTACGCCG GACGCATCGT GGCCGGCATC ACCGGCCCCA 

3301 CAGGTGCGGT TGCTGGCGCC TATATCGCCG ACATC ACCGA TGGGGAAGAT 

3351 CGGGCTCGCC ACTTCGGGCT CATGAGCGCT TGTTTCGGCG TGGGTATGGT 

3401 GGCAGGCCCG TGGCCGGGGG ACTGTTGGGC GCCATCTCCT TGCATGCACC 

3451 ATTCCTTGCG GCGGCGGTGC TCAACGGCCT CAACCTACTA CTGGGCTGCT 

3501 TCCTAATGCA GGAGTCGCAT AAGGGAGAGC GTCGACCGAT GCCCTTGAGA 

3551 GCCTTCAACC CAGTCAGCTC CTTCCGGTGG GCGCGGGGCA TGACTATCGT 

3601 CGCCGCACTT ATGACTGTCT TCTTTATCAT GCAACTCGTA GGACAGGTGC 

3651 CGGCAGCGCT CTGGGTCATT TTCGGCGAGG ACCGCTTTCG CTGGAGCGCG 

3701 ACGATGATCG GCCTGTCGCT TGCGGTATTC GGAATCTTGC ACGCCCTCGC 

3751 TCAAGCCTTC GTCACTGGTC CCGCCACCAA ACGTTTCGGC GAGAAGCAGG 

3801 CCATTATCGC CGGCATGGCG GCCGACGCGC TGGGCTACGT CTTGCTGGCG 

3851 TTCGCGACGC GAGGCTGGAT GGCCTTCCCC ATTATGATTC TTCTCGCTTC 

3901 CGGCGGCATC GGGATGCCCG CGTTGCAGGC CATGCTGTCC AGGCAGGTAG 

3951 ATGACGACCA TCAGGGACAG CTTCAAGGAT CGCTCGCGGC TCTTACCAGC 

4001 CTAACTTCGA TCACTGGACC GCTGATCGTC ACGGCGATTT ATGCCGCCTC 

4051 GGCGAGCACA TGGAACGGGT TGGCATGGAT TGTAGGCGCC GCCCTATACC 

4X01 TTGTCTGCCT CCCCGCGTTG CGTCGCGGTG CATGGAGCCG GGCCACCTCG 

4 151 ACCTGAATGG AAGCCGGCGG CACCTCGCTA ACGGATTCAC CACTCCAAGA 

4201 ATTGGAGCCA ATCAATTCTT GCGGAGAACT GTGAATGCGC AAACCAACCC 

4251 TTGGCAGAAC AT ATCCATCG CGTCCGCCAT CTCCAGCAGC CGCACGCGGC 

4301 GCATCTCGGG CCGCGTTGCT GGCGTTTf TC CATAGGCTCC GCCCCCCTGA 
4351 CGAGCATCAC AAAAATCGAC GCTCAAGTCA GAGGTGGCGA AACCCGACAG 

4401 GACTATAAAG ATACCAGGCG TTTCCCCCTG GAAGCTCCCT CGTGCGCTCT 
4451 CCTGTTCCGA CCCTGCCGCT TACCGGATAC CTGTCCGCCT TTCTCCCTTC 
4501 GGGAAGCGTG GCGCTTTCTC AATGCTCACG CTGTAGGTAT CTCAGTTCGG 
4551 TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG TGCACGAACC CCCCGTTCAG 
4601 CCCGACCGCT GCGCCTTATC CGGTAACTAT CGTCTTGAGT CCAACCCGGT 
4651 AAGACACGAC TTATCGCCAC TGGCAGCAGC CACTGGTAAC AGGATTAGCA 
4701 GAGCGAGGTA TGTAG g ( ^^ T ^6j C ^^ C ^ CAGT TCTTGAAGTG GTGGCCTAAC 
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F I G. l5(cont'd) 



4751 


TACGCCTACA 


ctagaaggac 


AGTATTTGGT 


ATCTGCGCTC 


TGCTGAAGCC 


4801 


AGTTACCTTC 


GGaaaaaGAG 


TTGGTAGCTC 


TTGATCCGGC 


AAACAAACCA 


4851 


CCGCTCGTAG 


cggtggtttt 


TTTGTTTGCA 


AGCAGCAGAT 


TACGCGCAGA 


4901 


AAAAAAGGAT 


ctcaagaaga 


TCCTTTGATC 


TTTTCTACGG 


GGTCTGACGC 


4951 


TCAGTGGAAC 


Gaaaactcac 


GTTAAGGGAT 


TTTGGTCATG 


AGATTATCAA 


5001 


AAAGGATCTT 


cacctagatc 


CTTTTAAATT 


AAA AATGAAG 


TTTTAAATCA 


5051 


ATCTAAAGTA 


tatatgagta 


AACTTGGTCT 


GACAGTTACC 


AATGCTTAAT 


5101 


CAGTGAGGCA 


CCTATCTCAG 


CGATCTGTCT 


ATTTCGTTCA 


TCCATAGTTG 


5151 


CCTGACTCCC 


CGTCGTGTAG 


ATAACTACGA 


TACGGGAGGG 


CTTACCATCT 


5201 


GGCCCCAGTG 


CTGCAATGAT 


ACCGCGAGAC 


CCACGCTCAC 


CGGCTCCAGA 


5251 


TTTATCAGCA 


ATAAACCAGC 


CAGCCGGAAG 


GGCCGAGCGC 


A G A A GTGGT C 


5301 


CTGCAACTTT 


ATCCGCCTCC 


ATCCAGTCTA 


TTAATTGTTG 


CCGGGAAGCT 


5351 


AGAGTAAGTA 


GTTCGCCAGT 


TAATAGTTTG 


CGCAACGTTG 


TTGCCATTGC 


5401 


TGCAGGCATC 


GTGGTGTCAC 


GCTCGTCGTT 


TGGTATGGCT 


TCATTCAGCT 


5451 


CCGGTTCCCA 


ACGATCAAGG 


CGAGTTACAT 


GATCCCCCAT 


GTTGTGCAAA 


5501 


A A AGCGGTT A 


GCTCCTTCGG 


TCCTCCGATC 


GTTGTCAGAA 


GTAAGTTGGC 


5551 


CGCAGTGTTA 


TCACTCATGG 


TTATGGCAGC 


ACTGCATAAT 


TCTCTTACTG 


5601 


TCATGCCATC 


CGTAAGATGC 


TTTTCTGTGA 


CTGGTGAGTA 


CTCAACCAAG 


5651 


TCAT7CTGAG 


AATAGTGTAT 


GCGGCGACCG 


AGTTGCTCTT 


GCCCGGCGTC 


5701 


AACACGGGAT 


AATACCGCGC 


CACATAGCAG 


AACTTTAAAA 


GTGCTCATCA 


5751 


TTGGAAAACG 


TTCTTCGGGG 


CGAAAACTCT 


CAAGGATCTT 


ACCGCTGTTG 


5601 


AGATCCAGTT 


CGATGTAACC 


CACTCGTGCA 


CCCAACTGAT 


CTTCAGCATC 


5851 


TTTTACTTTC 


ACCAGCGTTT 


CTGGGTGAGC 


AAAAACAGGA 


AGGCAAAATG 


5901 


CCGC A A A A A A 


GGGAATAAGG 


GCGACACGGA 


AATGTTGAAT 


ACTCATACTC 


5951 


TTCCTTTTTC 


AATATTATTG 


A AGCATTT AT 


CAGGGTTATT 


GTCTCATGAG 


6001 


CGGATACATA 


TTTGAATGTA 


TTTAGAAAAA 


TAAACAAATA 


GGGGTTCCGC 


6051 


GCACATTTCC 


CCGAAAAGTG 


CCACCTGACG 


TCTAAGAAAC 


CATTATTATC 


6101 

6151 
3805 


atga C a tt a a 

A 


CCTATAAAAA T AGGCGTATC 

89085519 


ACGAGGCCCT 


TTCGTCTTCA 
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P36392 

:BG368 backbont — » / / 
siolubtt T4#7 f~ I (j 

:AA #3 » LVS 
: 162AA*6AA 

: f rom 203-5 

bo392.seq Length: 6149 

1 CAATTAATTC CAGCTTCCTG TCCAATCTGT GTCAGTTAGG GTGTGGAAAG 

51 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 

101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

251 TGGCTGACTA A TTTTTTTT A TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 

351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGT C C A C TAGGGGGTCC 

451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGAA 

501 GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 

551 GGCTATAAAA GGGGGTGGGG GCGC6TTCGT CCTCACTCTC TTCCGCATCG 

601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGA C AAACTCTTCG 

651 CGGTTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 CTCTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

801 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 TGCTGATGAT GT A ATT A A AG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 

951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 

1001 TGGATCCAAG CTTCGACTCG AGGAATTCCC CGAAGGAACA AAGCACCCTC 

1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 

1101 GTTTGAGAAG CAGCGGGCAA GAAAGACGCA AGCCCAGAGG CCCTGCCATT 

1151 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 
MET 

1201 GCCAC*(ATGk ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 

Caa-23 r 

1251 ACTGGCGCTC CTCCCAGCAG C C A CTTCAGGG AAAGAAAGTG GTGCTGGGCA 

1301 AAAAAGGGGA TACAGTGC ^^^^^ : ^^ A CAGCTTCCCA GAAGAAGAGC 
1351 ATACAATTCC ACTGGAAAAA CTCCAACCAG ATAAAGATTC TGGGAAATCA 



5//S>3 

1401 GGGCTCCTTC TTAACTAAAG^'CTCCA^CC^A "gCTGAATGAT CGCGCTGACT 

1451 CAAGAAGAAG CTTGTGGGAC CAAGGAAACT TTCCCCTGAT CATCAAGAAT 

1501 CTTAAGATAG AAGACTCAGA TACTTACATC TGTGAAGTGG AGGACCAGAA 

1551 GGAGGAGGTG CAATTGCTAG TGTTCGGATT GACTGCCAAC TCTGACACCC 

1601 ACCTGCTTCA GGGGCAGAGC CTGACCCTGA CCTTGGAGAG CCCCCCTGGT 

1651 AGTAGCCCCT CAGTGCAATG TAGGAGTCCA AGGGGTAAAA ACATACAGGG 

1701 GGGGAAGACC CTCTCCGTGT CTCAGCTGGA GCTCCAGGAT AGTGGCACCT 

1751 GGACATGCAC TGTCTTGCAG AACCAGAAGA AGGTGGAGTT CAAAATAGAC 

STOP 

1801 ATCGTGGTGC TAGCTTTCCA GAACCTCCAG CATAGTCT#(T_AAjGAAAGAGG 

1851 GGGAACAGGT GGAGTTCTCC TTCCCACTCG CCTTTACAGT TGAAAAGCTG 

1901 ACGGGCAGTG GCGAGCTGTG GTGGCAGGCG GAGAGGGCTT CCTCCTCCAA 

1951 GTCTTGGATC ACCTTTGACC TGAAGAACAA GGAAGTGTCT GTAAAACGGG 

2001 TTACCCAGGA CCCTAAGCTC CAGATGGGCA AGAAGCTCCC GCTCCACCTC 

2051 ACCCTGCCCC AGGCCTTGCC TCAGTATGCT GGCTCTGGAA ACCTCACCCT 

2101 GGCCCTTGAA GCGAAAACAG GAAAGTTGCA TCAGGAAGTG AACCTGGTGG 

2151 TGATGAGAGC CACTCAGCTC CAGAAAAATT TGACCTGTGA GGTGTGGGGA 

2201 CCCACCTCCC CTAAGCTGAT GCTGAGTTTG AAACTGGAGA ACAAGGAGGC 

2251 AAAGGTCTCG AAGCGGGAGA AGGCGGTGTG GGTGCTGAAC CCTGAGGCGG 

2301 GGATGTGGCA GTGTCTGCTG AGTGACTCGG GACAGGTCCT GCTGGAATCC 

2351 AACATCAAGG TTCTGCCCAC ATGGTCGACC CCGGTGCAGC CAATGGCCCT 

2401 GATTTGAGAT CTTTGTGAAG GAACCTTACT TCTGTGGTGT GACATAATTG 

2451 GACAAACTAC CTACAGAGAT TTAAAGCTCT AAGGTAAATA TAAAATTTTT 

2501 AAGTGTATAA TGTGTTAAAC TACTGATTCT AATTGTTTGT GTATTTTAGA 

2551 TTCCAACCTA TGGAACTGAT GAATGGGAGC AGTGGTGGAA TGCCTTTAAT 

2601 GAGGAAAACC TGTTTTGCTC AGAAGAAATG CCATCTAGTG ATGATGAGGC 

2651 TACTGCTGAC TCTCAACATT CTACTCCTCC AAAA A AGAAG AGAAAGGTAG 

2701 * AAGACCCCAA GGACTTTCCT TCAGAATTGC TAAGTTTTTT GAGTCATGCT 

2751 GTGTTTAGTA ATAGAACTCT TGCTTGCTTT GCTATTTACA CCACAAAGGA 

2801 AAAAGCTGCA CTGCTATACA AGAAAATTAT GGAAAAATAT TCTGTAACCT 

2851 TTATAAGTAG GCATAACAGT TATAATCATA ACATACTGTT TTTTCTTACT 

2901 CCACACAGGC ATAGAGTGTC TGCTATTAAT AACTATGCTC AAAAATTGTG 

2951 TACCTTTAGC TTTTTA>rTT JfJM^°J? GGT TAATAAGGAA TATTTGATGT 



CAGAT CATAlTt 



3 8 0 *?001 ATACTGCCTT GACTAGAGAT CATA*ftAGC CATACCACAT TTGTAGAGGT 
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3051 TTTACTTGCT TTAAAAAACC TCCCACAC^T CCCCCTGAAC CTGAAACATA 

3101 AAATGAATGC AATTGTTGTT GTTAACTTGT TTATTGCAGC TTATAATGGT 

3151 T AC A A AT AAA GCAATAGCAT CACAAATTTC ACAAATAAAG CATTTTTTTC 

3201 ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG 

3251 TCTGGATCCT CTACGCCGGA CGCATCGTGG CCGGCATCAC CGGCGCCACA 

3301 GGTGCGGTTG CTGGCGCCTA TATCGCCGAC ATCACCGATG GGGAAGATCG 

3351 GGCTCGCCAC TTCGGGCTCA TGAGCGCTTG TTTCGGCGTG GGTATGGTGG 

3401 CAGGCCCGTG GCCGGGGGAC TGTTGGGCGC CATCTCCTTG CATGCACCAT 

3451 TCCTTGCGGC GGCGGTGCTC AACGGCCTCA ACCTACTACT GGGCTGCTTC 

3501 CTAATGCAGG AGTCGCATAA GGGAGAGCGT CGACCGATGC CCTTGAGAGC 

3551 CTTCAACCCA GTCAGCTCCT TCCGGTGGGC GCGGGGCATG ACTATCGTCG 

3601 CCGCACTfAT GACTGTCTTC TTTATCATGC AACTCGTAGG ACAGGTGCCG 

3651 GCAGCGCTCT GGGTCATTTT CGGCGAGGAC CGCTTTCGCT GGAGCGCGAC 

3701 GATGATCGGC CTGTCGCTTG CGGTATTCGG AATCTTGCAC GCCCTCGCTC 

3751 AAGCCTTCGT CACTGGTCCC GCCACCAAAC GTTTCGGCGA GAAGCAGGCC 

3801 ATTATCGCCG GCATGGCGGC CGACGCGCTG GGCTACGTCT TGCTGGCGTT 

3851 CGCGACGCGA GGCTGGATGG CCTTCCCCAT TATGATTCTT CTCGCTTCCG 

3901 GCGGCATCGG GATGCCCGCG TTGCAGGCCA TGCTGTCCAG GCAGGTAGAT 

3951 GACGACCATC AGGGACAGCT TCAAGGATCG CTCGCGGCTC TTACCAGCCT 

4001 AACTTCGATC ACTGGACCGC TGATCGTCAC GGCGATTTAT GCCGCCTCGG 

4051 CGAGCACATG GAACGGGTTG GCATGGATTG TAGGCGCCGC CCTATACCTT 

4101 GTCTGCCTCC CCGCGTTGCG TCGCGGTGCA TGGAGCCGGG CCACCTCGAC 

4151 CTGAATGGAA GCCGGCGGCA CCTCGCTAAC GGATTCACCA CTCCAAGAAT 

4201 TGGAGCCAAT CAATTCTTGC GGAGAACTGT GAATGCGCAA ACCAACCCTT 

4251 GGCAGAACAT ATCCATCGCG TCCGCCATCT CCAGCAGCCG CACGCGGCGC 

4301 ATCTCGGGCC GCGTTGCTGG CGTTTTTCCA TAGGCTCCGC CCCCCTGACG 

4351 AGCATCACAA AAATCGACGC TCAAGTCAGA GGTGGCGAAA CCCGACAGGA 

4401 CTATAAAGAT ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG TGCGCTCTCC 

4451 TGTTCCGACC CTGCCGCTTA CCGGATACCT GTCCGCCTTT CTCCCTTCGG 

4501 GAAGCGTGGC GCTTTCTCAA TGCTCACGCT GTAGGTATCT CAGTTCGGTG 

4551 TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG CACGAACCCC CCGTTCAGCC 

4601 CGACCGCTGC GCCTT ^ft^^^ A £If TCG T CTTGAGTCC AACCCGGTAA 

8 08 4651 GACACGACTT ATCGCCACTG GCAGCAGCCA CTGGTAACAG GATTAGCAGA 
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FIG.I6(cont'd) 

4701 GCGAGGTATG TAGGCGGTGC TACAGAGTTC TTGAAGTGGT GGCCTAACTA 

4751 CGGCTACACT AGAAGGACAG TATTTGGTAT CTGCGCTCTG CTGAAGCCAG 

4B01 TTACCTTCGG AAAAAGAGTT GCTAGCTCTT GATCCGGCAA ACAAACCACC 

4851 GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 

4901 AAAAGGATCT CAAGAAGATC CTTTGATCTT TTCTACGGGG TCTGACGCTC 

4951 AGTGGAACGA AAACTCACGT TAAGGGATTT TGGTCATGAG ATTATCAAAA 

5001 AGGATCTTCA CCTAGATCCT TTTAAATTAA AAATGAAGTT TTAAATCAAT 

5051 CTAAAGTATA TATGAGTAAA CTTGGTCTGA CAGTTACCAA TGCTTAATCA 

5101 GTGAGGCACC TATCTCAGCG ATCTGTCTAT TTCGTTCATC CATAGTTGCC 

5151 TGACTCCCCG TCGTGTAGAT AACTACGATA CGGGAGGGCT TACCATCTGG 

5201 CCCCAGTGCT GCAATGATAC CGCGAGACCC ACGCTCACCG GCTCCAGATT 

5251 TATCAGCAAT AAACCAGCCA GCCGGAAGGG CCGAGCGCAG AAGTGGTCCT 

5301 GCAACTTTAT CCGCCTCCAT CCAGTCTATT AATTGTTGCC GGGAAGCTAG 

5351 AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG CAACGTTGTT GCCATTGCTG 

5401 CAGGCATCGT GGTGTCACGC TCGTCGTTTG GTATGGCTTC ATTCAGCTCC 

5451 GGTTCCCAAC GATCAAGGCG AGTTACATGA TCCCCCATGT TGTGCAAAAA 

5501 AGCGGTTAGC TCCTTCGGTC CTCCGATCGT TGTCAGAAGT AAGTTGGCCG 

5551 CAGTGTTATC ACTCATGGTT ATGGCAGCAC TGCATAATTC TCTTACTGTC 

5601 ATGCCATCCG TAAGATGCTT TTCTGTGACT GGTGAGTACT CAACCAAGTC 

5651 ATTCTGAGAA TAGTGTATGC GGCGACCGAG TTGCTCTTGC CCGGCGTCAA 

5701 CACGGGATAA TACCGCGCCA CATAGCAGAA CTTTAAAAGT GCT CATC ATT 

5751 GGAAAACGTT CTTCGGGGCG AAAACTCTCA AGGATCTTAC CGCTGTTGAG 

580-1 ATCCAGTTCG ATGTAACCCA CTCGTGCACC CAACTGATCT TCAGCATCTT 

5851 TTACTTTCAC CAGCGTTTCT GGGTGAGCAA AAACAGGAAG GCAAAATGCC 

5901 GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC TCATACTCTT 

5951 CCTTTTTCAA TATTATTQAA GCATTTATCA GGGTTATTGT CTCATGAGCG 

6001 GATACATATT TGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC 

6051 ACATTTCCCC GAAAAGTGCC ACCJGACGTC TAAGAAACCA TTATTATCAT 




6101 GACATTAACC TATAAAAATA GGCGTATCaC GAGGCCCTTT CGTCTTCAA 
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*-S* CTA CCT TTT CCA CTC A 3' 

FIG. 18 

5' CAT CTC ACT CCA AAC 3* 



63 

5» ceo CTC ATA CTA A 3' 



5' CAT CTT ACT ATC A 3' 



67 

— 5 . C ccccxcACCcrc*cccTCACC^tccAcwca:c3• 

68 

5' CCG CCC CCC TCT CCA ACS TCA CSC TCA CCC TCT C 3 

69 

3* CCS SST AST ASC CCC TCA STS CAA TSA 3* 

70 



CAT CTC ATT CCA CTC ACS CSC TAC TAC 3' 



^S* CCC CCT AST ASC CCC TCA CTC CAA TCT ACS ACT C 3 ' 



72 

3' TAS SAC TCC TAC ATT SCA CTC ACS CCC TAC TAC 3 ' 
"j. cxj^ CSS CTA AAA ACA TAC ACS CCS CCA ASA CCT CA 3' 
^3' CAT CTC ACS TCT TTC CCC CCC TCT ATC TTT TTA CCC 3 ' 

S S' CCA CCA TAC TCC CAC CTC CAC ATC CAC TCT CTT CCA 
CAA CTC A 3' 



"S' CAT CTC AST TCT CCA ACA CAC TCC ATC TCC ACS 
CAC TAT CCT CCA CCT 3' 

38ii 89085519 
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^BG368 Dachoct FIG 19 

: AA 03 = LVS 

:tir$t 113 AA of T4 
roasicaHy uD tu vui 

og394.seq Length: 5365 

1 GAATTAATTC CACCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGAAAG 

51 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 

101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

25: TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 

351 GTATAGAAAC TCGGaCCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

40 1 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 

45 1 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGAA 

50 J GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 

551 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

60 1 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 

65 1 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

70 : CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 CTC T CGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

8C1 CCTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 . f GC~GATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

9U1 G T GAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 

951 TGA^ATCCAC TTTGCCTTTC T CTCC ACAGG TGTCCACTCC CAGGTCCAAC 

lUUI TGG-'CCAAG CTTCGACTCG AGGA A TTCCC CGAAGGAACA AAGCACCCTC 

1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCA A GTCCTCACAC AGATACGCCT 

1101 GT T T GAGA AG CAoCGGGCaa GaaaGaCGCa AGCCCAGAGG CCCTGCCATT 

1151 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 

120 1. GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 

1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAAAGTG GTGCTGGGCA 

130 1 a a A A AGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA GAAGAAGAGC 

3812 :35: i-iwU'Tcc ACTG 8^9^5 C 5| C ^ ACCAG ATAAAGATTC T &G& AAATCA 
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140 1 GOCCTCCTTC TTAACTAAAC gtccatccaa gctgaatgat cgcgctcact 

1451 CAAGAAGAAG cttgtgggac caaggaaact ttcccctgat catcaagaat 

1501 cttaagatag aagactcaga tacttacatc tgtgaagtgg aggaccagaa 

1551 ggaggaggtg caattgctag tgttcggatt gactgccaac tctgacaccc 

1601 ACCTGCTTCA GGGGTGATAG TAAGATCTTT GTGAAGGAAC CTTACTTCTG 

1651 TGGTGTGACA TAATTGGACA AACTACCTAC AGAGATTTAA AGCTCTAAGG 

1701 T A A AT AT A A A ATTTTTAAGT GTATAATGTG TTAAACTACT GATTCTAATT 

175 1 GTT^GTGTAT TTTAGATTCC AACCTATGGA ACTGATGAAT GGGAGCAGTG 

IdOl GTGGAATGCC TTTAATGAGG AAAACCTGTT TTGCTCAGAA GAAATGCCAT 

1851 CTAGTGATGA TGAGGCTACT GCTGACTCTC AACATTCTAC TCCTCCA AAA 

1901 A AG A A GAGA A AGGTAGAAGA CCCCAAGGAC TTTCCTTCAG AATTGCTAAG 

1951 TTTTTTGAGT CATGCTGTGT TTAGTAATAG AACTCTTGCT TGCTTTGCTA 

2001 TTTACACCAC AAAGGAAAAA GCTGCACTGC TATACAAGAA AATTATGGAA 

1:05 1 AAATATTCTG TAACCTTTAT AAGTAGGCAT AACAGTTATA ATCATAACAT 

?lUl ACTGTTTTTT cttactccac acacgcatag agtgtctgct ATTAATAACT 

2 151 ATGCTCAAAA ATTGTGTACC TTTAGCTTTT TAATTTGTAA AGGGGTTAAT 

2201 AAGGAATATT TGATGTATAG TGCCTTGACT AGAGATCATA ATCAGCCATA 

-25 1 CCACATTTGT AGAGGTTTT A CTTGCTTTAA AAAACCTCCC ACACCTCCCC 

7301 CTGAACCTGA AACATAAAAT GAATGCAATT GTTGTTGTTA ACTTGTTTAT 

2351 TGCAGCTTAT AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA 

2401 ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC CAAACTCATC 

245 1 AATGTATCTT ATCATGTCTG GATCCTCTAC GCCGGACGCA TCGTGGCCGG 

-5Gi CA-CACCGGC GCCACAGGTG CGGTTGCTGG CGCCTATATC GCCGACATCA 

2551 CCGATGGGGA AGATCGGGCT CGCCACTTCG GGCTCATGAG CGCTTGTTTC 

2601 GGCGTGGGTA TGGTGGC AGO CCCGTGGCCG GGGGACTGTT GGGCGCCATC 

2651 * CCT TGC a TG CalCaTTCCT TOCGGCGGCG GTGCTCaaCG GCCTCAACCT 

2701 ACTACTuGGC TGCTTCCTaa TGCAGGAGTC GCATAAGGGa gagcgtcgac 

275 1 CGATGCCCTT GAGAGCCTTC aaCCCaGTCa gctcctccg gtgggcgcgg 

2801 GGCATGACTA TCGTCGCCGC ACTTATGACT GTCTTCTTTA TCATGCAACT 

2851 CGTAGGACAG GTGCCGGCAG CGCTCTGGGT CATTTTCGGC GAGGACCGCT 

2901 TTCGCTGGAG CGCGACGATG ATCGGCCTGT CGCTTGCGGT ATTCGGAATC 

295 1 TTGCACGCCC TCGCTCAAGC CTTCGTCACT GGTCCCGCCA CCAAACGTTT 
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FIG. 19 (cont'd) 

305; ACGT C T TGCT GGCGTTCG-G ACGCGAGGCT GGATGGCCTT CCCCATTATG 

3101 ATTCTTCTCG CTTCCGGCGG CATCGGGATG CCCGCGTTGC AGGCCATGCT 

3151 GTCCAGGCAG GTAGATGACG ACCATCAGGG ACAGCTTCAA GGATCGCTCG 

3201 CGGCTCTTAC CAGCCT A ACT TCGATCACTG GACCGCTGAT CGTCACGGCG 

3251 ATTTATGCCG CCTCGGCGAG CACATGGAAC GGGTTGGCAT GGATTGTAGG 

3301 CGCCGCCCTA TaCCTTGTCT GCCTCCCCGC GTTGCGTCGC GGTGCATGGA 

3351 GCCGGGCCAC CTCGACCTGA ATGGAAGCCG GCGGCACCTC GCTAACGGAT 

3401 TCACCACTCC AAGAATTGGA GCCAATCAAT TCTTGCGGAG AACTGTGAAT 

:«45 1 GCGCAAACCA ACCCTTGGCA GAACATATCC ATCGCGTCCG CCATCTCCAG 

3501 CAGCCGCACG CGGCGCATCT CGGGCCGCGT TGCTGGCGTT TTTCCATAGG 

3551 CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG 

3601 GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 

3651 CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC 

37u: GCCTTTCTCC CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG 

375 1 GTATCTCAGT TCGGTGTAGG TCGTTCGCTC CAAGCTGGGC TGTGTGCACG 

3801 AACCCCCCGT TCaGCCCGaC CGCTGCGCCT tatccggtaa ctatcgtctt 

3B51 GAGTCCAACC CGGTAAGACA cgacttatcg ccactggcag CAGCCACTGG 

390: TAACAGGATT AGC AGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA 

J95 1 AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC 

«;60 : GCTCTGCTGA AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC 

40b 1 CGGCaaaCAA ACCACCGCTG GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC 

4101 AGATTACGCG CAGAAAAAAA ggatctcaag AAGATCCTTT gatgttttct 

^151 ACGGGGTCT.G ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT 

4201 CATGAGATTA TCA A A AAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT 

4251 GAAG^TTTAA ATCAATCTAA AGTATATATG AGTAAACTTG GTCTGACAGT 

4301 T A cc A ATGCT taatcacxa sgcacctatc tcagcgatc t gtctatttcg 

435 1 TTCATCCATA GTTGCCTGaC TCCCCGTCGT GTAGATAACT ACGATACGGG 

44C1 AGoOCTTACC atctggcccc aGTGCTGCAA tgataccgcg agacccacgc 

4451 TCACCGGCTC CAGATTT ATC AGCAATAAAC CAGCCAGCCG GAAGGGCCGA 

4501 GCGCAGAAGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT 

4551 GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC 

3814 4601 G — G-TGCCA TTGCTG ^^^g^J^ TG T CACGCTCGT CGTTTGGTAT 
4cSl Gl»CTTt-TTC AGCTCCGGTT CCCAACGATC AAGGCGAGTT ACATGATCCC 
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FIG. 19 (cont'd) 



4701 CCATGTTGTG CAAAAAAGCG GTTAGCTCCT TCGGTCCTCC GATCGTTGTC 

4751 AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG CAGCACTGCA 

4B01 TAATTCTCTT ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG 

4851 AGTACTCAAC CAAGTCATTC TGAGAATAGT GTATGCGGCG ACCGAGTTGC 

4901 TCTTGCCCGG CGTCAACACG GGATAATACC GCGCCACATA GCAGAACTTT 

4951 AAAAGTGCTC ATCATTGGAA AACGTTCTTC GGGGCGAAAA CTCTCAAGGA 

5001 TCTTACCGCT GTTGAGATCC AGTTCGATGT AACCCACTCG TGCACCCAAC 

5051 TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT GAGCAAAAAC 

5101 AGGAAGGCAA AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT 

5151 GAATACTCAT ACTCTTCCTT TTTCAATATT ATTGAAGCAT TTATCAGGGT 

5201 TATTGTCTCA TGAGCGGATA CATATTTGAA TGTATTTAGA AAAATAAACA 

5251 AATAGGGGTT CCGCGCACAT TTCCCCGAAA AGTGCCACCT GACGTCTAAG 

5301 AAACCATTAT TATCATGACA TTAACCTATA AAAATAGGCG TATCACGAGG 

5351 CCCTTTCGTC TTCAA 
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FIG. 20 



09396.5CQ Lengtn: 551B 

1 CAATTAATTC CAGCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGA AAG 

51 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 

101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GCCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

251 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CCAGAAGTAG TGAGGAGGCT TTTTTGGAGG GGTCCTCCTC 

351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 

451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGAA 

501 GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 

551 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 

651 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 CTCTCGAGAA AGGGGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

801 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 TGCTGATGAT GT AATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 

951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 

1001 TGGATCCAAG CTTCGACTCG AGGA ATTCCC CGAAGGAACA AAGCACCCTC 

1051 CCCACTGGGC TCCTGGTTGC AGAGCTCCAA GTCCTCACAC AGATACGCCT 

1101 GTTTGAGAAG CAGCGGGCAA GA A AGACGC A AGCCCAGAGG CCCTGCCATT 
1151 TCTGTGGGCT CAGGTCCCTA CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 
1201 GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 
1251 ACTGGCGCTC CTCCCAGCAG CCACTCAGGG AAAGAA AGTG GTGCTGGGCA 
1301 AAAAAGGGGA TACAGTGGAA CTGACCTGTA CAGCTTCCCA GAAGAAGAGC 
3816 1351 ATACAATTCC ACTGGA Q^^'jf AG ATAAACATTC TGGGAAATCA 
I4C! GCi^CTCCTTC TTAACTAAAG GTCCATCCAA GCTGAATGaT CGCGCTGACT 



FIG. 20 (cont'd) 

1451 CAACAA&AAC CTTCTGGGAC CAAGGAAACT TTCCCCTGAT CATCAAGAAT 

1501 CTTAAGATAG AAGACTCAGA TACTTACATC TGTGAAGTGG AGGACCAGAA 

1551 GGAGGAGGTG CAATTGCTAG TGTTCGGATT GACTGCCAAC TCTGACACCC 

1601 ACCTGCTTCA GGGGCAGAGC CTGACCCTGA CCTTGGAGAG CCCCCCTGGT 

1651 AGTAGCCCC T CACTGCAATG TAGGAGTCCA AGGGGTAAAA ACATACAGGG 

170 1 GGGGAAGACC CTCTCCGTGT CTCAGCTGGA GCTCCAGGAT AGTGGCACCT 

1751 GGACATGCAC TGTCTTGCAG AACTGAGATC TTTGTGAAGG AACCTTACTT 

3 801 CTGTGGTGTG ACATAATTGG ACAAACTACC TaCAGAGATT TAAAGCTCTA 

185 J AGGTAAATAT AAAATTTTTA AGTGTATAAT GTGTTAAACT ACTGATTCTA 

1901 ATTGTTTGTG TATTTTAGAT TCCAACCTAT GGAACTGATG AATGGGAGCA 

195 1 GTGGTGGAAT GCCT TTAATG AGGAAAACCT GTTTTGCTCA GAAGAAATGC 

2C0I catctagtga tgatgaggct actgctgact ctcaacattc tactcctcca 
205 1 aaaaaGAAGA gaaaggtaga agaccccaag gactttcctt cagaattgct 

2101 AAGTTTTTTG AGTCATGCTG TGTTTAGTAA TAGAACTCTT GCTTGCTTTG 

2151 ctatttacac cacaaaggaa AAAGCTGCAC tgctatacaa gaaaattatg 

2201 GA A A A A T ATT CTGTAACCTT TATAAGTAGG CATAACAGTT ATAATCATAA 

2251 CATACTGTTT TTTCTTACTC CACACAGGCA TAGAGTGTCT GCTATTAATA 

230 1 ACTATGCTCA AAA ATTGTGT ACCTTTAGCT TTTTAATTTG TAAAGGGGTT 

2351 A A T A AGGAAT ATTTGATGTA TAGTGCCTTG ACTAGAGATC ATAATCAGCC 

2401 ATACCACATT TGTAGAGGTT TTACTTGCTT TAAAAAACCT CCCACACCTC 

2451 CCCCTGAACC TGAAACATAA AATGAATGCA ATTGTTGTTG TTAACTTGTT 

2501 TATTGCAGCT TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA 

2551 C A aaTAAAGC ATTTTTTTCA CTGCATTCTA GTTGTGGTTT gtccaaactc 

2601 atcaatgtat cttatcatgt ctggatcctc tacgccggac gcatcgtggc 

2651 CGGCATCACC GGCGCCACAG GTGCGGTTGC TGGCGCCTAT ATCGCCGACA 

2701 TCACCGATGG GGAAGATCGG GCTCGCCACT TCGGGCTCAT GAGCGCTTGT 

2751- TTCGGCGTGG GTATGGTGGC AGGCCCGTGG CCGGGGGACT GTTGGGCGCC 

2BG1 ATC-CCTTGC ATGCACCATT CCTTGCGGCG GCGGTGCTC A ACGGCCTCAA 

2851 CCTACTACTG GGCTGCTTCC TAATGCAGGA GTCGCAT AAG GGAGAGCGTC 

2901 GACCGATGCC CTTGAGAGCC TTCAACCCAG TCAGCTCCTT CCGGTGGGCG 

295 1 CGGGGCATGA CTATCGTCGC CGCACTTATG ACTGTCTTCT TTATCATGCA 

38 1? 3001 AC**CG T AGGA CAGGTgogp0Cgg^|CgTCTG GGTCATTTTC GGCGAGGACC 

305 1 GCTTT:a:TG GAoCOCGACG ATGATCGGCC TGTCGCTTGC GGTATTCGCA 



• W • / V4»J"^V 



62/93 FIG. 20 (cont'd) 

31.01 ATCTTGCACG CCCTCGC JXA AGCCTTCGTC ACTGGTCCCG CCACCAAACG 
3151 TTTCGGCGAG AAGCAGGCCA TTATCGCCGG C\TGGCGGCC GACGCGCTGG 
3201 GCTACGTCTT GCTGGCGTTC GCGACGCGAG GCTGGATGGC CTTCCCCATT 
3251 ATGATTCTTC TCGCTTCCGG CGGCATCGGG ATGCCCGCGT TGCAGGCCAT 
3301 GCTGTCCAoG CaGGTAGATG ACGACCATCA GGGACAGC t T CaaGGATCGC 
3351 TCGCGGCTCT TACCAGCCTA ACTTCGATCA CTGGACCGC" GATCGTCACG 
3401 GCGATTTATG CCGCCTCGGC GAGCACATGG AACGGGTTGG CaTGGATTGT 
3451 AGGCGCCGCC CTATACCTTG TCTGCCTCCC CGCGTTGCGT CGCGGTGCAT 
3501 GGAGCCGGGC CACCTCGACC TGAATGGAAG CCGGCGGCAC CTCGCTAACG 
3551 GATTCACCAC TCCAAGAATT GGAGCCAA~"C AATTCTTGCG GAGAACTGTG 
3b01 A ATGCGCAAA CCAACCCTTG GCAGAACATA TCCATCGCGT CCGCCATCTC 
3651 CAGCAGCCGC ACGCGGCGCA t C TCGGGCCG CGTTGCTGGC GTTTTTCCAT 
3701 AGGCTCCGCC CCCCTGACGA GCATCACAAA AATCGACGCT CAAGTCAGAG 
3751 GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT CCCCCTGGAA 
3801 GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG 
3851 TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG 
3901 TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 
3951 ACGAACCCCC CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT 
4001 CTTGAGTCCA ACCCGGTAAG ACACGACTTA TCGCCACTGG CAGCAGCCAC 
40S1 TGGTAACAGG ATTAGCACAG CGAGGTATGT AGGCGGTGCT ACAGAGTTCT 
4101 TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC 
4151 TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 
4201 -aTCCGGC A A A CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 
4251 AGCAGATTAC GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT 
4301 TCTACGGGGT CTGACGCTCA GTGGAACGAA AACTCACGTT AAGGGATTTT 
4351 GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT TTAAATTAAA 
4401 AATGAACTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC 
4451 AGT-ACCAAT GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT 
4501 TCGTTCATCC ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC 
4551 GGGAGGGCTT ACCATCTGGC CCCAGTGCTG CAATGATACC GCGAGACCCA 
4601 CGCTCACCGG CTCCAGATTT ATCAGCAATA A ACCAGCCAG CCGGAAGGGC 
4651 CGAGCGCAGA ACTCG Wfi tf£^J T J CCCCTCCATC CAGTCTATTA 
A-C.\ iT-iTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 
3R1R 
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FIG. 20 (cont'd) 
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PBG3M r- 9 r% 4*% i 

:BQ366 backDone f» I (m / 

rtoluDlt T4#8 ' # W * ^ # 

:AA #3 = LVS 

:*'p«rf«ct" Stu/f <r$t 182 A A of T4 
:Das1ca1 ly up to V2J2 

DQ393.seq Length: 5566 

1 GAATTAATTC CAGCTTGCTG TGGAATGTGT GTCAGTTAGG GTGTGGAAAG 

5X TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 

101 GTCAGCAACC AGGTGTGGAA AGTCCCCAGG CTCCCCAGCA GGCAGAAGTA 

151 TGCAAAGCAT GCATCTCAAT TAGTCAGCAA CCATAGTCCC GCCCCTAACT 

201 CCGCCCATCC CGCCCCTAAC TCCGCCCAGT TCCGCCCATT CTCCGCCCCA 

251 TGGCTGACTA ATTTTTTTTA TTTATGCAGA GGCCGAGGCC GCCTCGGCCT 

301 CTGAGCTATT CCAGAAGTAG TGAGCAGGCT TTTTTGGAGG GGTCCTCCTC 

351 GTATAGAAAC TCGGACCACT CTGAGACGAA GGCTCGCGTC CAGGCCAGCA 

401 CGAAGGAGGC TAAGTGGGAG GGGTAGCGGT CGTTGTCCAC TAGGGGGTCC 

451 ACTCGCTCCA GGGTGTGAAG ACACATGTCG CCCTCTTCGG CATCAAGGAA 

501 GGTGATTGGT TTATAGGTGT AGGCCACGTG ACCGGGTGTT CCTGAAGGGG 

551 GGCTATAAAA GGGGGTGGGG GCGCGTTCGT CCTCACTCTC TTCCGCATCG 

601 CTGTCTGCGA GGGCCAGCTG TTGGGCTCGC GGTTGAGGAC AAACTCTTCG 

651 CGGTCTTTCC AGTACTCTTG GATCGGAAAC CCGTCGGCCT CCGAACGGTA 

701 CTCCGCCACC GAGGGACCTG AGCGAGTCCG CATCGACCGG ATCGGAAAAC 

751 . CTCTCGAGAA AGGCGTCTAA CCAGTCACAG TCGCAAGGTA GGCTGAGCAC 

801 CGTGGCGGGC GGCAGCGGGT GGCGGTCGGG GTTGTTTCTG GCGGAGGTGC 

851 • TGCTGATGAT GTAATTAAAG TAGGCGGTCT TGAGACGGCG GATGGTCGAG 

901 GTGAGGTGTG GCAGGCTTGA GATCGATCTG GCCATACACT TGAGTGACAA 

951 TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGGTCCAAC 

1001 TGGATCCAAG CTTCGACTCG AGGAATTCCC CGAAGGA ACA AAGCACCCTC 

105 1 CCCACTGGGC TCCTGGTTGC AGAGCTCCA A GTCCTCACAC AGATACGCCT 

1101 GTTTGAGAAG CAGCGGGCAA GAAAGACGCA AGCCCAGAGG CCCTGCCATT 

1151 TCTGTGGGCT CaGGTCCCTa CTGGCTCAGG CCCCTGCCTC CCTCGGCAAG 

1201 GCCACAATGA ACCGGGGAGT CCCTTTTAGG CACTTGCTTC TGGTGCTGCA 

1251 ACTGCCGCTC CTCCCAGCAG CCACTCAGGG AAAGAAAGTG GTGCTGGGCA 

1301 AAAAAGGGGA TACAGT ^t)|)^^^ < 2 C ^ GTA CAGCTTCCCA GAAGAAGAGC 

3 8 20 1351 ATA C A A TTC C actggaaaaa CTCCAACCAG ataaagattc TGGGAAATCA 



4,5/53 FIG. 21 (cont'd) 

1401 CGGCTCCTTC TTAACTAAAC GTCCATCCAA GCTGAATGAT CGCGCTGACT 

1451 CAAGAAGAAG CTTGTGGGAC CAAGGAAAwT TTCCCCTGAT CATCA AGAAT 

1501 CTTAAGATAG AAGACTCAGA TACTTACATC TGTGAAGTGG AGGACCAGAA 

1551 GGAGGAGGTG CAATTGCTAG TGTTCGGATT GACTGCCAAC TCTGACACCC 

1601 ACCTGCTTCA GGGGCAGAGC CTGACCCTGA CCTTGGAGAG CCCCCCTGGT 

1651 AGTAGCCCCT CAGTGCAATG TAGGAGTCCA AGGGGTAAAA ACATACAGGG 

1701 GGGGAAGACC CTCTCCGTGT CTCAGCTGGA GCTCCAGGAT AGTGGCACCT 

1751 GGACATGCAC TGTCTTGCAG AACCAGAAGA AGGTGGAGTT CAAAATAGAC 

1801 ATCGTGGTGC TAGCTTTCCA GTGAGATCTT TGTGAAGGAA CCTTACTTCT 

1851 GTGGTGTGAC ATAATTGGAC AAACTACCTA CAGAGATTTA AAGCTCTAAG 

1901 GT A A AT AT A A AATTTTTAAG TGTATAATGT GTTAAACTAC TGATTCTAAT 

1951 TGTTTGTGTA TTTTAGATTC CAACCTATGG AACTGATGAA TGGGAGCAGT 

2001 GGTGGAATGC CTTTAATGAG GAAAACCTGT TTTGCTCAGA AGAAATGCCA 

2051 TCTAGTGATG ATGAGGCTAC TGCTGACTCT CAACATTCTA CTCCTCCAAA 

2101 A A AG A A GAGA AAGGTAGAAG ACCCCAAGGA CTTTCCTTCA GAATTGCTAA 

2151 GTTTTTTGAG TCATGCTGTG TTTAGTAATA GAACTCTTGC TTGCTTTGCT 

2201 ATTTACACCA CA A AGGA AAA AGCTGCACTG CTATACAAGA AAATTATGGA 

2251 AAAATATTCT GTA ACCTTT A TAAGTAGGCA TAACAGTTAT AATCATAACA 

2301 TACTGTTTTT TCTTACTCCA CACAGGCATA GAGTGTCTGC TATTAATAAC 

2351 T ATGCTC AAA AATTGTGTAC CTTTAGCTTT TTAATTTGTA AAGGGGTTAA 

2401 TAAGGAATAT TTGATGTATA GTGCCTTGAC TAGAGATCAT AATCAGCCAT 

2451 ACCACATTTG TAGAGGTTTT ACTTGCTTTA AAAA ACCTCC CACACCTCCC 

2501 CC-TGAACCTG AA AC ATA AAA TGAATGCAAT TGTTGTTGTT AACTTGTTTA 

2551 TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC AAATTTCACA 

2601 AATAAAGCAT TTTTTTCACT GCATTCT AGT TGTGGTTTGT CCAA ACTCAT 

2651 CAA^GTATCT TATCATGTCT GGATCCTCTA CGCCGGACGC ATCGTGGCCG 

270*1 GCATCACCGG CGCCACAGGT GCGGTTGCTG GCGCCTATAT CGCCGACATC 

275 1 ACCGATGGGG AAGt'CGGGC TCGCCACTTC GGGCTCATGA GCGCTTGTTT 

2801 CGGCGTGGGT ATGGTGGCAG GCCCGTGGCC GGGGGACTGT TGGGCGCCAT 

2851 CTCCTTGCAT GCACCATTCC TTGCGGCGGC GGTGCTCA AC GGCCTCAACC 

2901 TACTACTGGG CTGCTTCCTA ATGCAGGAGT CGCATAAGGG AGAGCGTCGA 

295 1 CCGATGCCCT TGACA ^g^ g^J^f §^ GTC A ^CTCCTTCC GGTGGGCGCG 

38 2 1 ioc: gcgcatcact ATCG^CGCCG cacttatgac tgtcttcttt ATCATGCAAC 
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4701 AGCGCAGAAG TGGTCCTGCA ACTTTATCCG CCTCCATCCA GTCTATTAAT 

4751 TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG CCAGTTAATA GTTTGCGCAA 

4801 CGTTGTTGCC ATTGCTGCAG GCATCGTGGT GTCACGCTCG TCGTTTGGTA 

4851 TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT TACATGATCC 

4901 CCCATGTTGT GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT 

4951 CAGAAGTAAG TTGGCCGCAG TGTTATCACT CATGGTTATG GCAGCACTGC 

5001 ATAATTCTCT TACTGTCATG CCATCCGTAA GATGCTTTTC TGTGACTGGT 

5051 GAGTACTCAA CCAAGTCATT CTGAGAATAG TGTATGCGGC GACCGAGTTG 

5101 CTCTTGCCCG GCGTCAACAC GGGATAATAC CGCGCCACAT AGCAGAACTT 

5151 TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA ACTCTCAAGG 

5201 ATCTTACCGC TGTTGAGATC CAGTTCGATG TAACCCACTp GTGCACCCAA 

5251 CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG TGAGCAAAAA 

5301 CAGGAAGGCA AAATGCCGCA AAAAAGGGAA TAAGGGCGAC ACGGAAATGT 

5351 TGAATACTCA TACTCTTCCT TTTTCAATAT TATTGAAGCA TTTATCAGGG 

5401 TTATTGTCTC ATGAGCGGAT ACATATTTGA ATGTATTTAG AAAAATAAAC 

5451 AAATAGGGGT TCCGCGCACA TTTCCCCGAA AAGTGCC ACC TGACGTCTAA 

5501 GAAACCATTA TT ATCATGAC ATTAACCTAT AAAAAT AGGC GTATCACGAG 

5551 GCCCTTTCGT CTTCAA 
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3101 GTCTTGCTGG CGTTCGCGAT'C^TGbCTGG' ATGfeCCTTCC CCATTATGAT 

3151 TCTTCTCGCT TCCGGCGGCA TCGGGATGCC CGCGTTGCAG GCCATGCTGT 

3201 CCAGGCAGGT AGATGACGAC CATCAGGGAC AGCTTCAAGG ATCGCTCGCG 

325J GCTCTTACCA GCCTaaCTTC GATCACTGGA CCGCTGATCG TCACGGCGAT 

33C1 TTaTGCCGCC TCGGCGAGCA CATGGAACGG GTTGGCATGG ATTGTAGGCG 

335 1 CCGCCCTATA CCTTGTCTGC CTCCCCGCGT TGCGTCGCGG TGCATGGAGC 

3401 CGGGCCACCT CGACCTGAAT GGAAGCCGGC GGCACCTCGC TAACGGATTC 

3^51 ACCACTCCAA GAATTGGAGC CAATCAATTC TTGCGGAGAA CTGTGAATGC 

3501 GCAAACCAAC CCTTGGCAGA ACATATCCAT CGCGTCCGCC ATCTCCAGCA 

3551 GCCGCACGCG GCGCATCTCG GGCCGCGTTG CTGGCGTTTT TCCATAGGCT 

3601 CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 

3651 GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 

3701 CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC 

3751 CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCAATGCTCA CGC T GTAGG T 

3B01 ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG ▼GTGCACGAA 

3ti5l CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT ATCGTCTTGA 

3901 G^CCAACCCG GTaaGACaCG ACttaTCGCC ACTGGCAGCA gccactggta 

395 1 ACAGGATTAG CaGaoCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG 

4U0! 1GG1GGCCTA AC^ACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC 

4051 TC^GCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG 

4 10 1 GCAAACAAAC CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG 

4 15 1 -TTACGCGCA G A A A A A A A GG ATCTCAAGAA JGATCCTTTGA TCTTTTCTAC 

4201 GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA 

425 1 TGAGATTATC A A A A AGG.ATC TTCACCTAGA TCCTTTT AAA TTA AAA ATGA 

430! AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA 

435 1 CC--TGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT 

U4j! CiJCCATAG"T TGCCTGACTC CCCGTCGTGT AGAT A ACT A C GATACGGGAG 

4^51 GCCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG ACCCACGCTC 

45CJ ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGCCGAGC 

4551 GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT 

4601 TGCCGGGAAG CTaGaGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT 

465: t:t-c:a-t gc": agg^^ (j^^J^ 0 ACGCTCG ~ CG tttggtatgg 

^70 ■ C1TC- T TC«3 CTCC jd z : CaaCGatcaa ggcgag^^ac atgaTCCCCC 
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